per 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 



INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification ** 
C12N 15/54, 9/12 



A2 



(It) International Publication Number: WO 89/ 06691 

(43) International Publication Date: 27 July 1989 (27.07.89) 



(21) International Application Number: PCT/US89/00127 

(22) International Filing Date: 12 January 1989 (12.01.89) 

(31) Priority Application Number: 143,441 

(32) Priority Date: 12 January 1988(12.01.88) 

(33) Priority Country : US 



(71) Applicant: CETUS CORPORATION [US/US]; 1400 

Fifty-Third Street, Emeryville, CA 94608 (US). 

(72) Inventors: GELFAND, David, H. ; 6208 Chelton Drive, 

Oakland, CA 94611 (US). STOFFEL, Susanne ; 935 
Galvin Drive, El Cerrito, CA 94530 (US). LAWYER, 
Frances, C. : 6641 Saroni Drive, Oakland, CA 94611 
(US). SAIKL Randall, K. ; 320-39th Street, Rich- 
mond, CA 94805 (US). 



(74) Agent: HALLUIN, Albert, P.; Cetus Corporation, 1400 J 
Fifty-Third Street, Emeryville, CA 94608 (US). I 



(81) Designated States: AT (European patent), AU, BE (Eu- 
ropean patent), CH (European patent), DE (Euro- 
pean patent), DK, FI, FR (European patent), GB 
(European patent), HU, IT (European patent), JP, 
KR, LU (European patent), NL (European patent), 
NO, SE (European patent). 



Published 

Without international search report and to be repu- 
blished upon receipt of that report. 



(54) Title: PURIFIED THERMOSTABLE ENZYME 



(57) Abstract 



Recombinant DNA sequences encoding a thermostable DNA polymerase from Thermus aquaticus can be used to 
produce a recombinant protein with a molecular weight of about 86,000-95,000 daltons. The thermostable recombinant en- 
zyme can be used in a temperature-cycling chain reaction wherein at least one nucleic acid sequence is amplified in quanti- 
ty from an existing sequence with the aid of selected primers and nucleotide triphosphates. The enzyme is preferably 
stored in a buffer containing non-ionic detergents that lends stability to the enzyme. 




FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify Slates party to 
cations under the PCT. 



the PCT on the front pages of pamphlets publishing international appli- 



AT Austria 

AU Australia 

BB Barbados 

BE Belgium 

BG Bulgaria 

BJ Benin 

BR Brazil 

CF Central African Republic 

CG Congo 

CH Switzerland 

CM Cameroon 

DE Germany, Federal Republic of 

DK Denmark 



FR France 

GA Gabon 

GB United Kingdom 

HU Hungary 

IT Italy 

JP Japan 

KP Democratic People's Republic 

of Korea 

KR Republic of Korea 

LI Liechtenstein 

LK Sri Lanka 

LU Luxembourg 

MC Monaco 



ML Mali 

MR Mauritania 

MW Malawi 

NL Netherlands 

NO Norway 

RO Romania 

SD Sudan 

SE Sweden 

SN Senegal 

SU Soviet Union 

TD Chad 

TG Togo 

US United States of America 



WO 89/06691 



PCT/US89/00127 



1 

PURIFIED THERMOSTABLE ENZYME 

The present invention relates to a purified thermostable 
enzyme. In one embodiment the enzyme is DNA polymerase purified from 
Thermus aquaticus and has a molecular weight of about 86,000-95,000. 
5 In another embodiment the enzyme is DNA polymerase produced by 
recombinant means. 

Extensive research has been conducted on the isolation of 
DNA polymerases from mesophilic microorganisms such as E. coli . See, 
for example, Bessman et al. t J. Biol . Chem. (1957 ) 233^171-177 and 
10 Buttin and Kornberg (1966) J. Biol, Chem. 241 :5419-5427. 

In contrast, relatively little investigation has been made 
on the isolation and purification of DNA polymerases from 
thermophiles, such as Thermus aquaticus . Kaledin et al., Biokhymiya 
(1980) 45_:644-651 discloses a six-step isolation and purification 

15 procedure of DNA polymerase from cells of T^_ aquaticus YT1 strain. 
These steps involve isolation of crude extract, DEAE-cellulose 
chromatography, fractionation on hydroxyapati te, fractionation on 
DEAE-cellulose, and chromatography on single-strand DNA-cellulose. 
The pools from each stage were not screened for contaminating endo- 

20 and exonuclease(s). The molecular weight of the purified enzyme is 
reported as 62,000 daltons per monomeric unit. 

A second purification scheme for a pol unerase from T. 
aquaticus is described by A. Chien et al., J. Bacterid . (1976) 
127^:1550-1557. In this process, the crude extract is applied to a 

25 DEAE-Sephadex column. The dialyzed pooled fractions are then 
subjected to treatment on a phosphocellulose column. The pooled 
fractions are dialyzed and bovine serum albumin (BSA) is added to 
prevent loss of polymerase activity. The resulting mixture is loaded 
on a DNA-cellulose column. The pooled material from the column is 

30 dialyzed and analyzed by gel filtration to have a molecular weight of 
about 63,000 daltons, and, by sucrose gradient centrifugation of about 
68,000 daltons. 
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The use of a thermostable enzyme to amplify existing nucleic 
acid sequences in amounts that are large compared to the amount 
initially present has been suggested in U.S. Patent No. 4,683,195. 
Primers, nucleotide triphosphates, and a polymerase are used in the 
5 process, which involves denaturation, synthesis of template strands 
and hybridization. The extension product of each primer becomes a 
template for the production of the desired nucleic acid sequence. The 
patent discloses that if the polymerase employed is a thermostable 
enzyme, it need not be added after every denaturation step, because 

10 the heat will not destroy its activity. No other advantages or 
details are provided on the use of a purified thermostable DNA 
polymerase. Furthermore, New England Biolabs had marketed a 
polymerase from T\_ aquaticus , but was unaware that the polymerase 
activity decreased substantially with time in a storage buffer not 

15 containing non-ionic detergents. 

Accordingly, there is a desire in the art to produce a 
purified, stable thermostable enzyme that may be used to improve the 
nucleic acid amplification process described above. 

Accordingly, the present invention provides a purified 
20 thermostable enzyme that catalyzes combination of nucleotide 
triphosphates to form a nucleic acid strand complementary to a nucleic 
acid template strand. Preferably the purified enzyme is DNA 
polymerase from Thermus aquaticus and has a molecular weight of about 
86,000-95,000 daltons. This purified material may be used in a 
25 temperature-cycling amplification reaction wherein nucleic acid 
sequences are produced from a given nucleic acid sequence in amounts 
that are large compared to the amount initially present so that they 
can be manipulated and/or analyzed easily. 

The gene encoding the ONA polymerase enzyme from Thermus 
30 aquaticus has also been identified and cloned and provides yet another 
means to prepare the thermostable enzyme of the present invention. In 
addition to the gene encoding the approximately 86-000-95,000 dalton 
enzyme, gene derivatives encoding DNA polymerase activity are also 
presented. 
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The invention also encompasses a stable enzyme composition 
comprising a purified, thermostable enzyme as described above in a 
buffer containing one or more non- ionic polymeric detergents. 

Finally, the invention provides a method of purification for 
^ the thermostable polymerase of the invention which comprises treating 
an aqueous mixture containing the thermostable polymerase with a 
hydrophobic interaction chromatographic support under conditions which 
promote hydrophobic interactions and eluting the bound thermostable 
polymerase from said support with a solvent which attenuates 
10 hydrophobic interactions. 

The purified enzyme, as well as the enzymes produced by 
recombinant DNA techniques, provides much more specificity than the 
Klenow fragment, which is not thermostable, when used in the 
temperature-cycling amplification reaction. In addition, the purified 

13 enzyme and the recombinants produced enzymes exhibit the appropriate 
activity expected when TTP or other nucleotide triphosphates are not 
present in the incubation mixture with the DNA template. Also, the 
enzymes herein have a broader pH profile than that of the thermostable 
enzyme from Tjiermus aquaticus described in the literature, with more 

20 than 502 of the activity at pH 6.4 as at pH 8. 

Figure 1 is the DNA sequence and the predicted amino acid 
sequence for Taq polymerase. The amino acid sequence corresponding to 
the deduced primary translation product is numbered 1-832. 

« 

Figure 2 is. a restriction site map of plasmid pFC83 that 
contains the ^4.5 kb HindHI T. aquaticus DNA insert subcloned into 
plasmid BSM13+. 

Figure 3 is a restriction site map of plasmid pFC85 that 
contains the ~2.68 kb Hind I II to Asp718 T,_ aquaticus DNA insert 
subcloned into plasmid BSM13+. 

As used herein, "cell 0 , "cell line", and "cell culture" can 
be used interchangeably and all such designations include progeny. 
Thus, the words "transformants" or "transformed cells" includes the 
primary, subject cell and cultures derived therefrom without regard for 
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the number of transfers. It is also understood that all progeny may 
not be precisely identical in DNA content, due to deliberate or 
inadvertent mutations. Mutant progeny that have the same 
functionality as screened for in the originally transformed cell are 
5 included. 

The term "control sequences" refers to DNA sequences 
necessary for the expression of an operably linked coding sequence in 
a particular host organism. The control sequences that are suitable 
for procaryotes, for example, include a promoter, optionally an 
10 operator sequence, a ribosome binding site, and possibly, other as yet 
poorly understood sequences. Eucaryotic cells are known to utilize 
promoters, polyadenyl ation signals, and enhancers. 

The term "expression system" refers to DNA sequences 
containing a desired coding sequence and control sequences in operable 
13 linkage, so that hosts transformed with these sequences are capable of 
producing the encoded proteins. In order to effect transformation, 
the expression system may be included on a vector; however, the 
relevant DNA may then also be integrated into the host chromosome. 

The term "gene" as used herein refers to a DNA sequence that 
20 encodes a recoverable bioactive polypeptide or precursor. The 
polypeptide can be encoded by a full-length gene sequence or any 
portion of the coding sequence so long as the enzymatic activity is 
retained. 

« 

In one embodiment of tha invention, the DNA sequence 
25 encoding a full-length thermostable DNA polymerase of Thermus 
aquaticus (Taq) is provided. Figure 1 shows this DNA sequence and the 
deduced amino acid sequence. For convenience, the amino acid sequence 
of this Taq polymerase will be used as a reference and other forms of 
the thermostable enzyme will be designated by referring to the 
30 sequence shown in Figure 1. Since the N-terminal methionine may or 
may not be present, both forms are included in all cases wherein the 
thermostable enzyme is produced in bacteria. 

"Operably linked" refers to juxtaposition such that the 
normal function of the components can be performed. Thus, a coding 
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sequence "operably linked" to control sequences refers to a 
configuration wherein the coding sequences can be expressed under the 
control of the control sequences. 

The terra ■mixture" as it relates to mixtures containing Taq 
5 polymerase refers to a collection of materials which includes Taq 
polymerase but which also includes alternative proteins. If the Taq 
polymerase is derived from recombinant host cells, the other proteins 
will ordinarily be those associated with the host. Where the host is 
bacterial, the comtaminating proteins will, of course, be bacterial 
10 proteins. 

"Non-ionic polymeric detergents" refers to surface-active 
agents that have no ionic charge and that are characterized, for- 
purposes of this invention, by their ability to stabilize the enzyme 
herein at a pH range of from about 3.5 to about 9.5, preferably from 4 
15 to 8.5. 

The term "oligonucleotide" as used herein is defined as a 
molecule comprised of two or more deoxyribonucleotides or 
ribonucleotides, preferably, more than three. Its exact size will 
depend on many factors, which in turn depend on the ultimate function 
or use of the oligonucleotide. The oligonucleotide may be derived 
synthetically or by cloning. 

The term "primer" as used herein refers to an 
oligonucleotide, whether occurring naturally as in a purified 
restriction digest or produced synthetically, which is capable of 
25 acting as a point of initiation of synthesis when placed under 
conditions in which synthesis of a primer extension product which is 
complementary to a nucleic acid strand is initiated, i.e., in the 
presence of four different nucleotide triphosphates and thermostable 
enzyme in an appropriate buffer ('buffer" includes pH, ionic strength, 
cofactors, etc.) and at a suitable temperature. For Taq polymerase 
the buffer herein preferably contains 1.5-2 mH of a magnesium salt, 
preferably MgCl 2 , 150-200^ of each nucleotide, and lyuM of each 
primer, along with preferably 50 mM KC1, 10 mM Tris buffer, pH 8-8.4, 
and lOO/fg/ml gelatin. 
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The primer is preferably si ngl e- stranded for maximum 
efficiency in amplification, but may alternatively be double- 
stranded. If double- stranded, the primer is first treated to separate 
its strands before being used to prepare extension products* 

5 Preferably, the primer is an oligodeoxyribonucleotide. The primer 
must be sufficiently long to prime the synthesis of extension products 
in the presence of the thermostable enzyme. The exact lengths of the 
primers will depend on many factors, including temperature, source of 
primer and use of the method. For example, depending on the 

10 complexity of the target sequence, the oligonucleotide primer 
typically contains 15-25 nucleotides, although it may contain more or 
fewer nucleotides. Short primer molecules generally require cooler 
temperatures to form sufficiently stable hybrid complexes with 
template. 

15 The primers herein are selected to be "substantially" 

complementary to the different strands of each specific sequence to be 
amplified. This means that the primers must be sufficiently 
complementary to hybridize with their respective strands. Therefore, 
the primer sequence need "not reflect the exact sequence of the 
20 template. For example, a non-complementary nucleotide fragment may be 
attached to the 5' end of the primer, with the remainder of the primer 
sequence being complementary to the strand. Alternatively, non- 
complementary bases or longer sequences can be interspersed into the 
primer, provided 'that the primer sequence has sufficient 

25 complementarity with the sequence of the strand to be amplified to 
hybridize therewith and thereby form a template for synthesis of the 
extension product of the other primer. However, for detection 
purposes, particulary using labeled sequence-specific probes, the 
primers typically have exact complementarity to obtain the best 

30 results. 

As used herein, the terms "restriction endonucl eases" and 
"restriction enzymes" refer to bacterial enzymes each of which cut 
double-stranded DNA at or near a specific nucleotide sequence. 
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As used herein, the term "thermostable enzyme" refers to an 
enzyme which is stable to heat and is heat resistant and catalyzes 
(facilitates) combination of the nucleotides in the proper manner to 
form the primer extension products that are complementary to each 
3 nucleic acid strand. Generally, the synthesis will be initiated at 
the 3' end of each primer and will proceed in the 5 1 direction along 
the template strand, until synthesis terminates, producing molecules 
of different lengths. There may be a thermostable enzyme, however, 
which initiates synthesis at the 5' end and proceeds in the other 
10 direction, using the same process as described above. 

The thermostable enzyme herein must satisfy a single 
criterion to be effective for the amplification reaction, i.e., the 
enzyme must not become irreversibly denatured (inactivated) when 
subjected to the elevated temperatures for the time necessary to 

15 effect denaturation of double-stranded nucleic acids. Irreversible 
denaturation for purposes herein refers to permanent and complete loss 
of enzymatic activity. The heating conditions necessary for nucleic 
acid denaturation will depend, e.g., on the buffer salt concentration 
and composition and the Vength and nucleotide composition of the 

20 nucleic acids being denatured, but typically range from about 90 to 
about 105 C for a time depending mainly on the temperature and the 
nucleic acid length, typically about 0.5 to four minutes. Higher 
temperatures may be tolerated as the buffer salt concentration and/or 
GC composition of the nucleic acid is increased. Preferably, the 

25 enzyme will not become irreversibly denatured at about 90-100*C. 

The thermostable enzyme herein preferably has an optimum 
temperature at which it functions that is higher than about 40°C, 
which is the temperature below which hybridization of primer to 
template is promoted, although, depending on (1) salt concentration 

30 and composition and (2) composition and length of primer, 
hybridization can occur at higher temperature (e.g., 45-70°C). The 
higher the temperature optimum for the enzyme, the greater the 
specificity and/or selectivity of the primer-directed extension 
process. However, enz>raes that are active below 40*C, e.g., at 37 a C, 

35 are also within the scope of this invention provided they are heat- 
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stable* Preferably, the optimum temperature ranges from about 50 to 
90°C, more preferably 60-80°C. 

The thermostable enzyme herein may be obtained from any 
source and may be a native or recombinant protein. Examples of 
5 enzymes that have been reported in the literature as being resistant 
to heat include heat-stable polymerases, such as, e.g., polymerases 
extracted from the thermophilic bacteria Therm us flavus , Thermus 
ruber , Thermus thermophilus , Bacillus stearothermophilus (which has a 
somewhat lower temperature optimum than the others listed), Thermus 
10 aquaticus , Thermus lacteus t Thermus rubens, and Methanothermus 
fervidus . In addition, thermostable polymerases isolated from the 
thermophilic archaebacteria include, for example, Sulfolobus 
sol fataricus , Sul fol obus aci docal darius , Thermo pi asma acidophil um t 
Methano bacterium thermoautotrophicum , and Oesul furococcus mobilis . 

15 The thermostable enzyme of the invention has the amino acid 

sequence presented in Figure 1. In addition, any thermostable 
polymerase containing at least 501 homology to any contiguous stretch 
of nine or more amino acids presented therein is also intended to be 
within the scope of the invention. This homology can be determined 

20 using commercially available data banks such as the European Molecular 
Biology Laboratory (EMBL) or Genbank. Moreover, as new thermostable 
polymerases are identified, specific regions of homology between the 
newly identified sequences and the Taq polymerase sequence may be 
determined using, for example, the Sequence Analysis Software Package 

25 of the Genetics Computer Group of the University of Wisconsin. 
Specific regions of homology include the following sequences (numbered 
according to the numbering of amino acids in Figure 1): residues 190- 
204, 262-270, 569-587, 718-732, 743-759, and 778-790. 

The preferred thermostable enzyme herein is a DNA polymerase 
30 isolated from Thermus aquaticus . Various strains thereof are 
available from the American Type Culture Collection, Rockville, 
Maryland, and is described by T.D. Brock, J. Bact. (196 9) 98 :2 89-2 97 , 
and by T. Oshima, Arch. Microbiol. (1978) 117 : 189-196. One of these 
preferred strains is strain YT-1. 
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For recovering the native protein the cells are grown using 
any suitable technique. One such technique is described by Kaledin et 
al. t Biokhimiya (1980), supra , the disclosure of which is incorporated 
herein by reference. Briefly, the cells are grown on a medium, in one 
5 liter, of nitrilotriacetic acid (100 mg), tryptone (3 g), yeast 
extract (3 g), succinic acid (5 g), sodium sulfite (50 mg), riboflavin 
(1 mg), K 2 HP0 4 (522 mg). MgS0 4 ( 480 mg), CaCl 2 (222 mg), NaCl (20 mg), 
and trace elements. The pH of the medium is adjusted to 8.0 ± 0.2 
with K0H. The yield is increased up to 20 grams of cells/liter if 
10 cultivated with vigorous aeration at a temperature of 70*0. Cells in 
the late logarithmic growth stage (determined by absorbance at 550 nm) 
are collected by centrifugation, washed with a buffer and stored 
frozen at -20°C. 

In another method for growing the cells, described in Chien 
13 et al. f J. Bacteriol. (1976), supra t the disclosure of which is 
incorporated herein by reference, a defined mineral salts medium 
containing 0.3X glutamic acid supplemented with 0.1 mg/1 biotin, 0.1 
mg/1 thiamine, and 0,05 mg/1 nicotinic acid is employed. The salts 
include nitrilotriacetic acid, CaS0 4 , MgS0 4 , NaCl, KN0 3 , NaN0«j , ZnS0 4 , 
20 H3BO3, CuS0 4 , NaMo0 4 , CoCl 2 , FeCl 3 , MnS0 4 , and Na 2 HP0 4 . The pH of the 
medium is adjusted to 8.0 with NaOH. 

In the Chien et al. technique, the cells are grown initially 
at 75 C in a water bath shaker. On reaching a certain density, 1 
liter of these cells is transferred to 16-liter carboys which are 
23 placed in hot-air incubators. Sterile air is bubbled through the 
cultures and the temperature maintained at 75* C. The cells are 
allowed to grow for 20 hours before being collected by centrifuge. 

After cell growth, the isolation and purification of the 
enzyme take place in six stages, each of which is carried out at a 
30 temperature below room temperature, preferably about 4*C. 

In the first stage or step, the cells, if frozen, are 
thawed, disintegrated by ultrasound, suspended in a buffer at about pH 
7.5, and centrifuged. 
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In the second stage, the supernatant is collected and then 
fractionated by adding a salt such as dry amnonium sulfate. The 
appropriate fraction (typically 45-751 of saturation) is collected, 
dissolved in a 0.2 M potassium phosphate buffer preferably at pH 6.5, 
5 and dialyzed against the same buffer. 

The third step removes nucleic acids and some protein. The 
fraction from the second stage is applied to a DEAE-cellulose column 
equilibrated with the same buffer as used above. Then the column is 
washed with the same buffer and the flow-through protein-containing 
10 fractions, determined by absorbance at 280 nm, are collected and 
dialyzed against a 10 mM potassium phosphate buffer, preferably with 
the same ingredients as the first buffer, but at a pH of 7.5. 

In the fourth step, the fraction so collected is applied to 
a hydroxyapati te column equilibrated with the buffer used for dialysis 

15 in the third step. The column is then washed and the enzyme eluted 
with a linear gradient of a buffer such as 0.01 M to 0.5 M potassium 
phosphate buffer at pH 7.5 containing 10 mM 2-mercaptoethanol and 52 
glycerine. The pooled fractions containing thermostable enzyme (e.g., 
DNA polymerase) activity are dialyzed against the same buffer used for 

20 dialysis in the third step. 

In the fifth stage, the dialyzed fraction is applied to a 
DEAE-cellulose column, equilibrated with the buffer used for dialysis 
in the third step. m The column is then washed and the enzyme eluted 
with a linear gradient of a buffer such as 0.01 to 0.6 M KC1 in the 

25 buffer used for dialysis in the third step. Fractions with 
thermostable enzyme activity are then tested for contaminating 
deoxyribonucl eases (endo- and exonucleases ) using any suitable 
procedure. For example, the endonuclease activity may be determined 
electrophoretically from the change in molecular weight of phage 

30 lambda DNA or supercoiled plasmid DNA after incubation with an excess 
of DNA pol>roerase. Similarly, exonuclease activity may be determined 
electrophoretically from the change in molecular weight of DNA after 
treatment with a restriction enzyme that cleaves at several sites. 
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The fractions determined to have no deoxyribonuclease 
activity are pooled and dialyzed against the same buffer used in the 
third step. 

In the sixth step, the pooled fractions are placed on a 
5 phosphocellulose column with a set bed volume. The column is washed 
and the enzyme eluted with a linear gradient of a buffer such as 0.01 
to 0.4 M KC1 in a potassium phosphate buffer at pH 7.5. The pooled 
fractions having thermostable polymerase activity and no 
deoxyribonuclease activity are dialyzed against a buffer at pH 8.0. 

10 Tte molecular weight of the dialyzed product may be 

determined by any technique, for example, by SDS-PAGE analysis using 
protein molecular weight markers. The molecular weight of one of the 
preferred enzymes herein, the DNA polymerase purified from Thermus 
aquaticus. is determined by the above method to be about 86,000-90,000 

13 daltons. The molecular weight of this same DNA polymerase as 
determined by the predicted amino acid sequence is calculated to be 
approximately 94,000 daltons. Thus, the molecular weight of the full 
length DNA polymerase is dependent upon the method employed to 
determine this number and falls within the range of 86,000-95,000 

20 daltons. 

The thermostable enzyme of this invention may also be 
produced by recombinant DNA techniques, as the gene encoding this 
enzyme has been cloned from The-mus aquaticus genomic DNA. The 
complete coding sequence for the Thermus aquaticus (Taq) polymerase 
can be derived from bacteriophage CH35:Taq#4-2 on an approximately 3.5 
kilobase (kb) Bgni-Asp718 (partial) restriction fragment contained 
within an *»18 kb genomic DNA insert fragment. This bacteriophage was 
deposited with the American Type Culture Collection (ATCC) on May 29. 
1987 and has accession no. 40,366. Alternatively, the gene can be 
constructed by ligating an ~ 730 base pair (bp) Bql II-HindIII 
restriction fragment isolated from plasraid pFC83 (ATCC 67,422 
deposited May 29, 1987) to an <v2.68 kb HindIII-Asp718 restriction 
fragment isolated from plasmid pFC85 (ATCC 67,421 deposited May 29, 
1987). The pFC83 restriction fragment comprises the ami no- terminus of 
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the Taq polymerase gene while the restriction fragment from pFC85 
comprises the carboxy- terminus. Thus, ligation of these two fragnents 
into a correspondingly digested vector with appropriate control 
sequences will result in the translation of a full-length Taq 
5 polymerase. 

As stated previously, the DNA and deduced amino acid 
sequence of a preferred thermostable enzyme is provided in Figure 1. 
In addition to the N-terminal deletion described supra , it has also 
been found that the entire coding sequence of the Taq polymerase gene 
10 *is not required to recover a biologically active gene product with DNA 
polymerase activity. Amino- terminal deletions wherein approximately 
one-third of the coding sequence is absent has resulted in producing a 
gene product that is quite active in polymerase assays. 

In addition to the N-terminal deletions, individual amino 
15 acid residues in the peptide chain comprising Taq polymerase may be 
modified by oxidation, reduction, or other derivatization, and the 
protein may be cleaved to obtain fragments that retain activity. Such 
alterations that do not destroy activity do not remove the protein 
from the definition, and are specifically included. 

20 Thus, modifications to the primary structure itself by 

deletion, addition, or alteration of the amino acids incorporated into 
the sequence during translation can be made without destroying the 
high temperature DNA polymerase activity of the protein. Such 
substitutions or other alterations result in proteins having an amino 
25 acid sequence encoded by DNA falling within the contemplated scope of 
the present invention. 

Polyclonal antiserum from rabbits inmunized with the 
purified 86,000-95,000 dalton polymerase of this invention was used to 
probe a Thermus aquaticus partial genomic expression library to obtain 
30 the appropriate coding sequence as described below. The cloned 
genomic sequence can be expressed as a fusion polypeptide, expressed 
directly Using its own control sequences, or expressed by 
constructions using control sequences appropriate to the particular 
host used for expression of the enzyme. 
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Of course, the availability of DNA encoding these sequences 
provides the opportunity to modify the codon sequence so as to 
generate mutein (mutant protein) forms also having DNA polymerase 
activity. 

3 Thus, these tools can provide the complete coding sequence 

for Taq DNA polymerase from which expression vectors applicable to a 
variety of host systems can be constructed and the coding sequence 
expressed. Portions of the Taq pol ymerase-encoding sequence are 
useful as probes to retrieve other thermostable polymerase-encoding 
sequences in a variety of species. Accordingly, portions of the 
15 genomic DNA encoding at least four to six amino acids can be 
replicated in coli and the denatured forms used as probes or 
oligodeoxyribonucleotide probes can be synthesized which encode at 
least four to six amino acids and used to retrieve additional DNAs 
encoding a thermostable polymerase. Because there may not be a 
precisely exact match between the nucleotide sequence in the Thermus 
aquaticus form and that in the corresponding portion of other species, 
oligomers containing approximately 12-18 nucleotides (encoding the 
four to six amino acid stretch) are probably necessary to obtain 
hybridization under conditions of sufficient stringency to eliminate 
false positives. The sequences encoding six amino acids would supply 
information sufficient for such probes. 

!n general, terms, the production of a recombinant form of 
Taq polymerase typically involves the following: 

First, a DNA is obtained that encodes the mature (used here 
to include all muteins) enzyme or a fusion of the Taq polymerase to an 
additional sequence that does not destroy its activity or to an 
additional sequence cleavable under controlled conditions (such as 
treatment with peptidase) to give an active protein. If the sequence 
is uninterrupted by introns it is suitable for expression in any 
host. This sequence should be in an excisable and recoverable form. 

The excised or recovered coding sequence is then preferably 
placed in operable linkage with suitable control sequences in a 
replicable expression vector. The vector is used to transform a 
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suitable host and the transformed host cultured under favorable 
conditions to effect the production of the recombinant Taq 
polymerase. Optionally the Taq polymerase is isolated from the medium 
or from the cells; recovery and purification of the protein may not be 
5 necessary in some instances, where some impurities may be tolerated. 

Each of the foregoing steps can be done in a variety of 
ways. For example, the desired coding sequences may be obtained from 
genomic fragments and used directly in appropriate hosts. The 
constructions for expression vectors operable in a variety of hosts 
10 are made using appropriate replicons and control sequences, as set 
forth below. Suitable restriction sites can, if not normally 
available, be added to the ends of the coding sequence so as to 
provide an excisable gene to insert into these vectors. 

The control sequences, expression vectors, and 
15 transformation methods are dependent on the type of host cell used to 
express the gene. Generally, procaryotic, yeast, insect or mammalian 
cells are presently useful as hosts. Procaryotic hosts are in general 
the most efficient and convenient for the production of recombinant 
proteins and therefore preferred for the expression of Taq polymerase. 

20 In the particular case of Taq polymerase, evidence indicates 

that considerable deletion at the N-terminus of the protein may occur 
under both recombinant and native conditions, and that the DNA 
polymerase activity qt the protein is still retained. It appears that 
the native proteins previously isolated may be the result of 

25 proteolytic degradation, and not translation of a truncated gene. The 
mutein produced from the truncated gene of plasmid pFC85 is, however, 
fully active in assays for DNA polymerase, as is that produced from 
DNA encoding the full-length sequence. Since it is clear that certain 
N-terminal shortened forms of the polymerase are active, the gene 

30 constructs used for expression of these polymerases may also include 
the corresponding shortened forms of the coding sequence. 



WO 89/06691 



PCT/US89/00127 



15 



10 



15 



20 



23 



30 



Procaryotes most frequently are represented by various 
strains of coin However, other microbial strains may also be 
used, such as bacilli, for example. Bacillus subtilis . various species 
of Pseudomonas, or other bacterial strains. In such procaryotic 
systems, plasmid vectors that contain replication sites and control 
sequences derived from a species compatible with the host are used. 
For example, E. coli is typically transformed using derivatives of 
PBR322, a plasmid derived from an E. coli species by Bolivar, et al., 
Gene (1977) 2:95. pBR322 contains genes for ampicillin and tetra- 
cycline resistance, and thus provides additional markers that can be 
either retained or destroyed in constructing the desired vector. 
Commonly used procaryotic control sequences, which are defined herein 
to include promoters for transcription initiation, optionally with an 
operator, along with ribosome binding site sequences, include such 
commonly used promoters as the beta-lactaraase (penicillinase) and 
lactose (lac) promoter systems (Chang, et al., Nature (1977) 
198:1056), the tryptophan (trp) promoter system (Goeddel , et al., 
Nucleic Acids Res. (1980) 8:4057) and the lambda-derived P L promoter 
(Shimatake, et al., Nature (1981) 292_:128) and N-gene ribosome binding 
site, which has been made useful as a portable control cassette (as 
set forth in U.S. Patent No. 4,711,845, issued December 8, 1987), 
which comprises a first DNA sequence that is the P L promoter operably 
linked to a second DNA sequel corresponding to N RBS upstream of a 
third DNA sequence having at least one restriction site that permits 
cleavage within six bp 3' of the N RBS sequence. Also useful is the 
phosphatase A (phoA) system described by Chang, et al. in European 
Patent Publication No. 196,864 published October 8, 1986, assigned to 
the same assignee and incorporated herein by reference. However, any 
available promoter system compatible with procaryotes can be used. 

In addition to bacteria, eucaryotic microbes, such as yeast, 
may also be used as hosts. Laboratory strains of Saccharomvces 
cerevisiae . Baker's yeast, are most used, although a number of other 
strains are commonly available. While vectors employing the 2 micron 
origin of replication are illustrated (Broach, J. R., Meth. Enz. 
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(1983) 101:307), other plasroid vectors suitable for yeast expression 
are known (see, for example, Stinchcomb, et al., Nature (1979) 282 :39, 
Tschempe, et a1 M Gene (1980) Hhl57 and Clarke, L., et al., Meth. 
Enz. (1983) 101:300). Control sequences for yeast vectors include 
5 promoters for the synthesis of glycolytic enzymes (Hess, et al., 
Adv. Enzyme Reg. (1968) 7_:149; Holland, et al. t Biotechnology (1978) 
17_:4900). 

Additional promoters known in the art include the promoter 
for 3-phosphoglycerate kinase (Hitzeman, et al., J. Biol. Chem. (1980) 
255:2073), and those for other glycolytic enzymes, such as 

10 glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvate 
decar boxyl ase , phos phof ructoki nase , gl ucose-6-phos phate i somerase , 3- 
phosphoglycerate mutase, pyruvate kinase, triosephos phate isomerase, 
phosphoglucose isomerase, and glucokinase. Other promoters that have 
the additional advantage of transcription controlled by growth 

15 conditions are the promoter regions for alcohol dehydrogenase 2, 
isocytochrome C, acid phosphatase, degradative enzymes associated with 
nitrogen metabolism, and enzymes responsible for maltose and galactose 
ultilization (Holland, supra ). 

It is also believed that terminator sequences are desirable 
20 at the 3' end of the coding sequences. Such terminators are found in 
the 3* untranslated region following the coding sequences in yeast- 
derived genes. Many of the vectors illustrated contain control 
sequences derived from the enolase gene containing plasmid peno46 
(Holland, M. J., et al., J. Biol. Chem. (1981) 25^:1385) or the LEU2 
25 gene obtained from YEpl3 (Broach, J., et al., Gene (1978) £:121); 
however, any vector containing a yeast-compatible promoter, origin of 
replication, and other control sequences is suitable. 

It is also, of course, possible to express genes encoding 
polypeptides in eucaryotic host cell cultures derived from 
30 multicellular organisms. See, for example. Tissue Culture , Academic 
Press, Cruz and Patterson, editors (1973). Useful host cell lines 
include murine myelomas N51, VERO and HeLa cells, and Chinese hamster 
ovary (CHO) cells. Expression vectors for such cells ordinarily 
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include promoters and control sequences compatible with mammalian 
cells such as. for example, the commonly used early and late promoters 
from Simian Virus 40 (SV 40) (Fiers, et al.. Nature (1978 ) 271:113). 
or other viral promoters such as those derived from polyoma! 
Adenovirus 2, bovine papiloma virus, or avian sarcoma viruses, or 
immunoglobulin promoters and heat shock promoters. A system for 
expressing DNA in mammalian systems using the BPV as a vector is 
disclosed in U.S. Patent 4.419,446. A modification of this system is 
described in U.S. Patent 4.601.978. General aspects of mammalian cell 
host system transformations have been described by Axel. U.S. Patent 
No. 4.399,216. It now appears, also, that "enhancer" regions are 
important in optimizing expression; these are, generally, sequences 
found upstream of the promoter region. Origins of repl ication may -be 
obtained, if needed, from viral sources. However, integration into 
13 the chromosome is a common mechanism for DNA replication in 
eucaryotes. 

Plant cells are also now available as hosts, and control 
sequences compatible with plant cells such as the nopaline synthase 
promoter and polyadenylation signal sequences (Depicker. A.', et al., 
20 J. Hoi. Add!. Gen. MQft?) i_ :561 ) are available . 

Recently, in addition, expression systems employing insect 
cells utilizing the control systems provided by baculovirus vectors 
have been described ^ (Mil ler, D. W., et al.. in Ge.netic Enqineerino 
(1986) Setlow, J. K. et al., eds., Plenum Publishing, Vol. 8, pp. 277- 
25 297). These systems are also successful in producing Taq polymerase. 

Depending on the host cell used, transformation is done 
using standard techniques appropriate to such cells. The calcium 
treatment employing calcium chloride, as described by Cohen. S. N.. 
Proc. Natl. Acad. Sci. fUSA) (1972) 69:2110 is used for procaryotes or 
other cells that contain substantial cell wall barriers. Infection 
with ^bacterium tumefaciens (Shaw. C. H., et al.. Gene (1983) 
2J.:315) is used for certain plant cells. For mammalian cells without 
such cell walls, the calcium phosphate precipitation method of Graham 
and van der Eb. Virology (1978) 52:546 is preferred. Transformations 
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5 into yeast are carried out according to the method of Van Solingen, 
P., et al., J. Bact. (1977) 130_:946 and Hsiao, C. I., et al., Proc. 
Natl. Acad. Sci . (USA) (1979) 76:3829. 

The strategy for isolating DNA encoding desired proteins, 
such as the Taq polymerase encoding DNA, using the bacteriophage 

10 vector lambda gtll, is as follows. A library can be constructed of 
EcoRI- flanked Alul fragments, generated by complete digestion of 
Thermus aquaticus DNA, inserted at the EcoR I site in the lambda gtll 
phage (Young and Davis, Proc. Natl. Acad. Sci USA (1983) M_:1194- 
1198). Because the unique EcoR I site in this bacteriophage is located 

15 in the carboxy- terminus of the -galactosidase gene, inserted DNA (in 
the appropriate frame and orientation) is expressed as protein fused 
with -galactosidase under the control of the lactose operon 
promoter /operator. 

Genomic expression libraries are then screened using the 
20 antibody plaque hybridization procedure. A modification of this 
procedure, referred to as "epitope selection," uses antiserum against 
the fusion protein sequence encoded by the phage, to confirm the 
identification of hybridized plaques. Thus, this library of 
recombinant phages could be screened with antibodies that recognize 
23 the 86,000-95,000 dalton Taq polymerase in order to identify phage 
that carry DNA segments encoding the antigenic determinants of this 
protein. 

Approximately 2 x 10 5 recombinant phage are screened using 
total rabbit Taq polymerase antiserum. In this primary screen, 

30 positive signals are detected and one or more of these phages are 
purified from candidate plaques which failed to react with preimmune 
serum and reacted with immune serum and analyzed in some detail. To 
examine the fusion proteins produced by the recombinant phage, 
lysogens of the phage in the host Y1089 are produced. Upon induction 

35 of the lysogens and gel electrophoresis of the resulting proteins, 
each lysogen may be observed to produce a new protein, not found in 
the other lysogens, or duplicate sequences may result. Phage 
containing positive signals are picked; in this case, one positive 
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plaque was picked for further identification and replated at lower 
densities to purify recombinants and the purified clones were analyzed 
by size class via digestion with EcoR I restriction enzyme. Probes can 
then be made of the isolated DNA insert sequences and labeled 
=> appropriately and these probes can be used in conventional colony or 
plaque hybridization assays described in Maniatis et al. t Molecular 
Cloning: A Laboratory Manual (1982), the disclosure of which is 
incorporated herein by reference. 

The labeled probe was used to probe a second genomic library 
10 constructed in a Charon 35 bacteriophage (Wilhelmine, A. M. et al., 
Gene (1983) 26:171-179). This library was made from Sau3A partial 
digestions of genomic Thermus aquaticus DNA and size fractionated 
fragments (15-20 kb) were cloned into the BamHI site of the Charon 35 
phage. The probe was used to isolate phage containing DNA encoding 
15 the Taq polymerase. One of the resulting phage, designated 
CH35 :Taql4-2, was found to contain the entire gene sequence. Partial 
sequences encoding portions of the gene were also isolated. 

Construction of suitable vectors containing- the desired - 
coding and control sequences employs standard ligation and restriction 
20 techniques that are well understood in the art. Isolated plasmids, 
DNA sequences, or synthesized oligonucleotides are cleaved, tailored, 
and religated in the form desired. 

Site-specific DNA cleavage is performed by treating with the 
suitable restriction enzyme (or enzymes) under conditions that are 

23 generally understood in the art, and the particulars of which are 
specified by the manufacturer of these conroercially available 
restriction enzymes. See, e.g., New England Biolabs, Product 
Catalog. In general, about l^g of plasraid or DNA sequence is cleaved 
by one unit of enzyme in about 20^1 of buffer solution; in the 

30 examples herein, typically an excess of restriction enzyme is used to 
ensure complete digestion of the DNA substrate. Incubation times of 
about one -hour to two hours at about 37 °C are workable, although 
variations can be tolerated. After each incubation, protein is 
removed by extraction with phenol/chloroform, and may be followed by 
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5 ether extraction, and the nucleic acid recovered from aqueous 
fractions by precipitation with ethanol. If desired, size separation 
of the cleaved fragnents may be performed by polyacryl amide gel or 
agarose gel electrophoresis using standard techniques. A general 
description of size separations is found in Methods in Enzymoloqy 

10 (1980) 65*99-560. 

Restriction-cleaved fragnents may be blunt-ended by treating 
with the large fragment of E. coli DNA polymerase I (Klenow) in the 
presence of the four deoxynucleotide triphosphates (dNTPs) using 
incubation times of about 15 to 25 minutes at 20 to 25* C in 50 mM Tris 

15 pH 7.6, 50 mM NaCl, 10 mM MgCl 2 » 10 mM DTT and 50-100 /iM dNTPs. The 
Klenow fragment fills in at 5' sticky ends, but chews back protruding 
3' single strands, even though the four dNTPs are present. If 
desired, selective repair can be performed by supplying only one of 
the, or selected, dNTPs within the limitations dictated by the nature 

20 of the sticky ends. After treatment with Klenow, the mixture is 
extracted with phenol /chloroform and ethanol precipitated. Treatment 
under appropriate conditions with SI nuclease results in hydrolysis of 
any single-stranded portion.. 

Synthetic oligonucleotides may be prepared using the 
25 triester method of Matteucci, et al., ( J. Am. Chem. Soc. (1981) 
103 :3185-3I91) or using automated synthesis methods. Kinasing of 
single strands prior to annealing or for labeling is achieved using an 
excess, e.g., approximately 10 units of polynucleotide kinase to 1 nM 
substrate in the presence of 50 mM Tris, pH 7.6, 10 mM MgC^i 5 mM 
30 dithiothreitol f 1-2 mM ATP. If kinasing is for labeling of probe, the 
ATP will contain high specific activity gamma- P. 

Ligations are performed in 15-30 volumes under the 
following standard conditions and temperatures: 20 mM Tris-Cl pH 7.5, 
10 mM MgCl 2 . 10 mM DTT, 33 ^g/ml BSA, 10 mM-50 mM NaCl, and either 
35 40 pJA ATP, 0.01-0.02 (Weiss) units T4 DNA ligase at 0 C C (for "sticky 
end" ligation) or 1 mM ATP, 0.3-0.6 (Weiss) units T4 DNA ligase at 
14°C (for "blunt end 11 ligation). Intermolecular "sticky end" 
ligations are usually performed at 33-100 /iq/m\ total DNA 
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concentrations (5-100 nM total end concentration). Intermodular 
blunt end ligations (usually employing a 10-30 fold molar excess of 
linkers) are performed at l^M total ends concentration. 

In vector construction employing "vector fragments", the 
vector fragment is commonly treated with bacterial alkaline 
phosphatase (BAP) in order to remove the 5* phosphate and prevent 
religation of the vector. BAP digestions are conducted at pH 8 in 
approximately 150 iffi Tris, in the presence of Na + and Hg +2 using about 
1 unit of BAP per mg of vector at 60° C for about one hour. In order 
to recover the nucleic acid fragments, the preparation is extracted 
with phenol /chloroform and ethanol precipitated. Alternatively, 
religation can be prevented in vectors that have been double digested 
by additional restriction enzyme digestion of the unwanted fragments. 

For portions of vectors derived from cONA or genomic ONA 
15 that require sequence modifications, site-specific primer-directed 
mutagenesis is used. This technique is now standard in the art, and 
is conducted using a synthetic oligonucleotide primer complementary to 
a single-stranded phage DNA to be mutagenized except for limited 
mismatching, representing the desired mutation. Briefly, the 
20 synthetic oligonucleotide is used as a pr-hner to direct synthesis of a 
strand complementary to the phage, and the resulting doubl e- stranded 
DNA is transformed into a phage-supporting host bacterium. Cultures 
of the transformed bacteria are plated in top agar, permitting plaque 
formation from single cells that harbor the phage. 

23 Theoretically, 50X of the new plaques will contain the phage 

having, as a single strand, the mutated form; 50X will have the 
original sequence. The plaques are transferred to nitrocellulose 
filters and the •li-fts" hybridized with kinased synthetic primer at a 
temperature that permits hybridization of an exact match, but at which 
the mismatches with the original strand are sufficient to prevent 
hybridization. Plaques that hybridize with the probe are then picked 
and cultured, and the ONA is recovered. 

In the constructions set forth below, correct ligations for 
plasraid construction are confirmed by first transforming coli 
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strain MM294, or other suitable host, with the ligation mixture. 
Successful transforroants are selected by ampicillin, tetracycline or 
other antibiotic resistance or using other markers, depending on the 
mode of plasmid construction, as is understood in the art. Plasmids 
5 from the transformants are then prepared according to the method of 
Clewell, D.B., et al. t Proc. Natl. Acad. Sci. (USA) (1969) 6^:1159, 
optionally following chloramphenicol amplification (Clewell, D.B., J. 
Bacteriol. (1972) 110:667). The isolated DNA is analyzed by 
restriction and/or sequenced by the dideoxy method of Sanger, F., et 
10 al., Proc. Natl. Acad. Sci. (USA) (1977) 7£:5463 as further described 
by Messing, et al . t Nucleic Acids Res. (1981) 9^309, or b y the method 
of Maxam, et al. f Methods in Enzymoloqy (1980) 65 :499. 

Host strains used in cloning and expression herein are as 

follows : 

15 For cloning and sequencing, and for expression of 

constructions under control of most bacterial promoters, E^ col i 
strain MM294 obtained from col i Genetic Stock Center GCSC #6135, 
was used as the host. For expression under control of the P|_Nrbs 
promoter, E. coli strain K12 MC1000 lambda lysogen, N 7 N 53 cI857 SusP g0 , 

20 ATCC 39531 may be used. Used herein are E^ col i DG116, which was 
deposited with ATCC (ATCC 53606) on April 7, 1987 and E. coli KB2 , 
which was deposited with ATCC (ATCC 53075) on March 29, 1985. 

For M13 ph^ge recombinants, E± col i strains susceptible to 

phage infection, such as E. coli K12 strain DG98, are employed. The 

25 DG98 strain has been deposited with ATCC July 13, 1984 and has 
accession number 39768. 

Mammalian expression can be accomplished in COS-7 C0S-A2 , 
CV-1, and murine cells, and insect cell-based expression in Spodoptera 
frugi peida) . 

30 in addition to the purification procedures previously 

described, the thermostable polymerase of the invention may be 
purified using hydrophobic interaction chromatography. Hydrophobic 
interaction chromatography is a separation technique in which 
substances are separated on the basis of differing strengths of 
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hydrophobic interaction with an uncharged bed material containing 
hydrophobic groups. Typically, the column is first equilibrated under 
conditions favorable to hydrophobic binding, e.g., high ionic 
strength. A descending salt gradient may be used to elute the sample. 

According to the invention, the aqueous mixture (containing 
5 either native or recombinant polymerase) is loaded onto a column 
containing a relatively strong hydrophobic gel such as Phenyl 
Sepharose (manufactured by Pharmacia) or Phenyl TSK (manufactured by 
Toyo Soda). To promote hydrophobic interaction with a Phenyl 
Sepharose column, a solvent is used which contains, for example, 
greater than or equal to 0.2 M ammonium sulfate, with 0.2 M being 

10 preferred. Thus the column and the sample are adjusted to 0.2 M 
ammonium sulfate in 50 mM Tris-lmM EDTA buffer and the sample applied 
to the column. The column is washed with the 0.2 M ammonium sulfate 
buffer. The enzyme may then be eluted with solvents which attenuate 
hydrophobic interactions such as, for example, decreasing salt 

15 gradients, ethylene or propylene glycol, or urea. For the recombinant 
Taq polymerase, a preferred embodiment involves washing the column 
sequentially with the Tris-EDTA buffer and the Tris-EDTA buffer 
containing 20X ethylene glycol. The Taq polymerase is subsequently 
eluted from the column with a 0-4 M urea gradient in the Tris-EDTA 

20 ethylene glycol buffer. 

For long-term stability, the enzyme herein must be stored in 
a buffer that contains one or more non-ionic polymeric detergents. 
Such detergents are generally those that have a molecular weight in 
the range of approximately 100 to 250,000, preferably about 4,000 to 

25 200,000 daltons and stabilize the enzyme at a pH of from about 3.5 to 
about 9.5, preferably from about 4 to 8.5. Examples of such 
detergents include those specified on pages 295-298 of McCutcheon's 
Emulsifiers & Detergents . North American edition (1983), published by 
the McCutcheon Division of MC Publishing Co., 175 Rock Road, Glen 

30 Rock, NJ (USA), the entire disclosure of which is incorporated herein 
by reference. Preferably, the detergents are selected from the group 
comprising ethoxylated fatty alcohol ethers and lauryl ethers, 
ethoxylated alkyl phenols, octylphenoxy polyethoxy ethanol compounds, 
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modified oxyethylated and/or oxypropylated straight-chain alcohols, 
polyethylene glycol monooleate compounds, polysorbate compounds, and 
phenolic fatty alcohol ethers. More particularly preferred are Tween 
20, from ICI Anericas Inc., Wilmington, DE, which is a 
5 polyoxyethylated (20) sorbitan raonolaurate, and Iconol NP-40, from 
BASF Wyandotte Corp. Parsippany, NJ, which is an ethoxylated alkyl 
phenol (nonyl). 

The thermostable enzyme of this invention may be used for 
any purpose in which such enzyme is necessary or desirable. In a 
10 particularly preferred embodiment, the enzyme herein is employed in 
the amplification protocol set forth below. 

The amplification protocol using the enzyme of the present 
invention may be the process for amplifying existing nucleic acid 
sequences that is disclosed and claimed in U.S. Patent No, 4,683,202, 
15 issued July 28, 1987, the disclosure of which is incorporated herein 
by reference. Preferably, however, the enzyme herein is used in the 
amplification process disclosed below. 

Specifically, the * ampl if ication method involves amplifying 
at least one specific nucleic acid sequence contained in a nucleic 
20 acid or a mixture of nucleic acids, wherein if the nucleic acid is 
double-stranded, it consists of two separated complementary strands of 
equal or unequal length, which process comprises: 

(a) contacting each nucleic acid strand with four different 
nucleotide triphosphates and one oligonucleotide primer for each 

25 different specific sequence being amplified, wherein each primer is 
selected to be substantially complementary to different strands of 
each specific sequence, such that the extension product synthesized 
from one primer, when it is separated from its complement, can serve 
as a template for synthesis of the extension product of the other 

30 primer, said contacting being at a temperature which promotes 
hybridization of each primer to its complementary nucleic acid strand; 

(b) contacting each nucleic acid strand, at the same time 
as or after step (a), with a DNA polymerase from Thermus aquaticus 
which enables combination of the nucleotide triphosphates to form 
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primer extension products complementary to each strand of each nucleic 
acid; 

(c) maintaining the mixture from step (b) at an effective 
temperature for an effective time to promote the activity of the 
enzyme, and to synthesize, for each different sequence being 
amplified, an extension product of each primer which is complementary 
to each nucleic acid strand template, but not so high as to separate 
each extension product from its complementary strand template; 

(d) heating the mixture from step (c) for an effective time 
and at an effective temperature to separate the primer extension 
products from the templates on which they were synthesized to produce 
single-stranded molecules, but not so high as to denature irreversibly 
the enzyme; 

(e) cooling the mixture from step (d) for an effective time 
and to an effective temperature to promote hybridization of each 

15 primer to each of the si ngl e- stranded molecules produced in step (d); 
and 

(f) maintaining the mixture from step (e) at an effective 
temperature for an effective time to promote the activity of the 
enzyme and to synthesize, for each different sequence being amplified, 

20 an extension product of each primer which is complementary to each 
nucleic acid strand template produced in step (d), but not so high as 
to separate each extension product from its complementary strand 
template wherein the effective time and temperatures in steps (e) and 
(f) may coincide (steps (e) and (f) are carried out simultaneously), 

25 or may be separate. 

Steps (d)-(f) may be repeated until the desired level of 
sequence amplification is obtained. 

The amplification method is useful not only for producing 
large amounts of an existing completely specified nucleic acid 
30 sequence, but also for producing nucleic acid sequences which are 
known to exist but are not completely specified. In either case an 
initial copy of the sequence to be amplified must be available, 
although it need not be pure or a discrete molecule. 
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In general, the amplification process involves a chain 
reaction for producing, in exponential quantities relative to the 
number of reaction steps involved, at least one specific nucleic acid 
sequence given (a) that the ends of the required sequence are known in 
5 sufficient detail that oligonucleotides can be synthesized which will 
hybridize to them, and (b) that a small amount of the sequence is 
available to initiate the chain reaction. The product of the chain 
reaction will be a discrete nucleic acid duplex with termini 
corresponding to the ends of the specific primers employed. 

10 Any nucleic acid sequence, in purified or nonpurified form, 

can be utilized as the starting nucleic acid(s), provided it contains 
or is suspected to contain the specific nucleic acid sequence 
desired. Thus, the process may employ, for example, DNA or RNA, 
including messenger RNA, which DNA or RNA may be single-stranded or 

15 double-stranded. In addition, a DNA-RNA hybrid which contains one 
strand of each may be utilized. A mixture of any of these nucleic 
acids may also be employed, or the nucleic acids produced from a 
previous amplification reaction herein using the same or different 
primers may be so utilized." The specific nucleic acid sequence to be 

20 amplified may be only a fraction of a larger molecule or can be 
present initially as a discrete molecule, so that the specific 
sequence constitutes the entire nucleic acid. 

It is not necessary that the sequence to be amplified be 
■ 

present initially in a pure form; it may be a minor fraction of a 
25 complex mixture, such as a portion of the beta-globin gene contained 
in whole human DNA (as exemplified in Saiki et al.. Science , 230, 
1530-1534 (1985)) or a portion of a nucleic acid sequence due to a 
particular microorganism which organism might constitute only a very 
minor fraction of a particular biological sample. The starting 
30 nucleic acid sequence may contain more than one desired specific 
nucleic acid sequence which may be the same or different. Therefore, 
the amplification process is useful not only for producing large 
amounts of one specific nucleic acid sequence, but also for amplifying 
simultaneously more than one different specific nucleic acid sequence 
35 located on the same or different nucleic acid molecules. 



WO 89/06691 



PCT/US89/00127 



27 

The nucleic acid(s) may be obtained from any source, for 
example, from plasmids such as pBR322, from cloned DNA or RNA, or from 
natural DNA or RNA from any source, including bacteria, yeast, 
viruses, organelles, and higher organisms such as plants or animals. 
5 DNA or RNA may be extracted from blood, tissue material such as 
chorionic villi, or amniotic cells by a variety of techniques such as 
that described by Maniatis et al., supra , p. 280-281. 

If probes are used which are specific to a sequence being 
amplified and thereafter detected, the cells may be directly used 
10 without extraction of the nucleic acid if they are suspended in 
hypotonic buffer and heated to about 90-100°C, until cell lysis and 
dispersion of intracellular components occur, generally 1 to 15 
minutes. After the heating step the amplification reagents may be 
added directly to the lysed cells. 

10 Any specific nucleic acid sequence can be produced by the 

amplification process. It is only necessary that a sufficient number 
of bases at both ends of the sequence be known in sufficient detail so 
that two oligonucleotide primers can be prepared which will hybridize 
to different strands of the desired sequence and at relative positions 

20 along the sequence such that an extension product synthesized from one 
primer, when it is separated from its template (complement), can serve 
as a template for extension of the other primer into a nucleic acid 
sequence of defined length. The greater the knowledge about the bases 
at both ends of the sequence, the greater can be the specificity of 

25 the primers for the target nucleic acid sequence, and thus the greater 
the efficiency of the process. 

It will be understood that the word "primer" as used 
hereinafter may refer to more than one primer, particularly in the 
case where there is some ambiguity in the information regarding the 
30 terminal sequence (s) of the fragment to be amplified. For instance, 
in the case where a nucleic acid sequence is inferred from protein 
sequence information, a collection of primers containing sequences 
representing all possible codon variations based on degeneracy of the 
genetic code will be used for each strand. One primer from this 
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collection will be homologous with the end of the desired sequence to 
be amplified. 

The oligonucleotide primers may be prepared using any 
suitable method, such as, for example, the phosphotri ester and 

5 phosphodi ester methods described above, or automated embodiments 
thereof. In one such automated embodiment, di ethyl phosphoramidi tes 
are used as starting materials and may be synthesized as described by 
Beaucage et al., Tetrahedron Letters (1981), 22_:1859-1862. One method 
for synthesizing oligonucleotides on a modified solid support is 

10 described in U.S. Patent No. 4,458,066. It is also possible to use a 
primer which has been isolated from a biological source (such as a 
restriction endonuclease digest). 

The specific nucleic acid sequence is produced by using the 
nucleic acid containing that sequence as a template. The first step 
15 involves contacting each nucleic acid strand with four different 
nucleotide triphosphates and one oligonucleotide primer for each 
different nucleic acid sequence being amplified or detected. If the 
nucleic acids to be amplified or detected are ONA, then the nucleotide 
triphosphates are dATP, dCTP, dGTP and TTP. 

20 The nucleic acid strands are used as a template for the 

synthesis of additional nucleic acid strands. This synthesis can be 
performed using any suitable method. Generally it occurs in a 
buffered aqueous solution, preferably at a pH of 7-9, most preferably 
about 8. Preferably, a molar excess (for cloned nucleic acid, usually 

25 about 1000:1 primer: tempi ate, and for genomic nucleic acid, usually 
about 10^:1 primer: tempi ate) of the two oligonucleotide primers is 
added to the buffer containing the separated template strands. It is 
understood, however, that the amount of complementary strand may not 
be known if the process herein is used for diagnostic applications, so 

30 that the amount of primer relative to the amount of complementary 
strand cannot be determined with certainty. As a practical matter, 
however, the amount of primer added will generally be in molar excess 
over the amount of complementary strand (template) when the sequence 
to be amplified is contained in a mixture of complicated long-chain 
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nucleic acid strands. A large molar excess is preferred to improve 
the efficiency of the process- 
Preferably the concentration of nucleotide triphosphates is 
150-200 each in the buffer for amplification and MgCl 2 is present 
5 in the buffer in an amount of 1.5-2 mM to increase the efficiency and 
specificity of the reaction. 

The resulting solution is then treated according to whether 
the nucleic acids being amplified or detected are double or single- 
stranded. If the nucleic acids are single-stranded, then no 

10 denaturation step need be employed, and the reaction mixture is held 
at a temperature which promotes hybridization of the primer to its 
complementary target (template) sequence. Such temperature is 
generally from about 35* C to 65° C or more, preferably about 37-60 e C 
for an effective time, generally one-half to five minutes, preferably 

13 one-three minutes. Preferably, 45-58 p C is used for Taq polymerase and 
15-mer primers to increase the specificity of primer hybridization. 
Shorter primers need lower temperatures. 

The complement to the original single-stranded nucleic acid 
may be synthesized by adding one or two oligonucleotide primers 

20 thereto. If an appropriate single primer is added, a primer extension 
product is synthesized in the presence of the primer, the DNA 
polymerase from Thermus aquaticus and the nucleotide triphosphates. 
The product will be partially complementary to the single-stranded 
nucleic acid and wilt hybridize with the nucleic acid strand to form a 

25 duplex of strands of unequal length which may then be separated into 
single strands as described above to produce two single separated 
complementary strands. Alternatively, two appropriate primers may be 
added to the sing! e- stranded nucleic acid and the reaction carried 
out. 

30 If the nucleic acid contains two strands, it is necessary to 

separate the strands of the nucleic acid before it can be used as the 
template. This strand separation can be accomplished by any suitable 
denaturing method including physical, chemical or enzymatic means. 
One preferred physical method of separating the strands of the nucleic 
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acid involves heating the nucleic acid until it is completely ( >99%) 
denatured. Typical^ heat denaturation involves temperatures ranging 
from about 90 to 105 V C for times generally ranging from about O.S to 5 
minutes. Preferably the effective denaturing temperature is 90-100* C 
5 for 0.5 to 3 minutes. Strand separation may also be induced by an 
enzyme from the class of enzymes known as heli cases or the enzyme 
RecA, which has helicase activity and in the presence of riboATP is 
known to denature DNA. The reaction conditions suitable for 
separating the strands of nucleic acids with helicases are described 
10 by Kuhn Hoffmann-Berling, CSH-Quantitati ve Biology . 43_:63 (1978), and 
techniques for using RecA are reviewed in C. Radding, Ann . Rev . 
Genetics » 16^:405-37 (1982). The denaturation produces two separated 
complementary strands of equal or unequal length. 

If the double-stranded nucleic acid is denatured by heat, 
15 the reaction mixture is allowed to cool to a temperature which 
promotes hybridization of each primer present to its complementary 
target (template) sequence. This temperature is usually from about 
35 C* to 65 C C or more, depending on reagents, preferably 37-60*0, 
maintained for an effective time, generally 0.5 to 5 minutes, and 
20 preferably 1-3 minutes. In* practical terms, the temperature is simply 
lowered from about 95°C to as low as 37 ^C, preferably to about 45-58* C 
for Taq polymerase, and hybridization occurs at a temperature within 
this range. 

Whether the nucleic acid is single- or double-stranded, the 
25 DNA polymerase from Thermus aquaticus may be added at the denaturation 
step or when the temperature is being reduced to or is in the range 
for promoting hybridization. The reaction mixture is then heated to a 
temperature at which the activity of the enzyme is promoted or 
optimized, i.e., a temperature sufficient to increase the activity of 
30 the enzyme in facilitating synthesis of the primer extension products 
from the hybridized primer and template. The temperature must 
actually be sufficient to synthesize an extension product of each 
primer whicJi is complementary to each nucleic acid template, but must 
not be so high as to denature each extension product from its 
35 complementary template (i.e., the temperature is generally less than 
about 80 C C-90* ? C). 
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Depending mainly on the types of enzyme and nucleic acid(s) 
employed, the typical temperature effective for this synthesis 
reaction generally ranges from about 40 to 80°C, preferably 50-75°C. 
The temperature more preferably ranges from about 65-75°C when a DNA 
polymerase from Thermus aquaticus is employed. The period of time 
5 required for this synthesis may range from about 0.5 to 40 minutes or 
more, depending mainly on the temperature, the length of the nucleic 
acid, the enzyme and the complexity of the nucleic acid mixture, 
preferably one to three minutes. If the nucleic acid is longer, a 
longer time period is generally required. The presence of 
10 dimethyl sulfoxide (DHSO) is not necessary or recommended because DMSO 
was found to inhibit Taq polymerase enzyme activity. 

The newly synthesized strand and its complementary nucleic 
acid strand fonm a double-stranded molecule which is used in the 
succeeding steps of the process. In the next step, the strands of the 

13 double-stranded molecule are separated by heat denaturation at a 
temperature effective to denature the molecule, but not so high that 
the thermostable enzyme is completely and irreversibly denatured or 
inactivated. Depending mainly on the type of enzyme and the length of 
nucleic acid, this temperature generally ranges from about 90 to 105 

20 C, more preferably 90-100 C, and the time for denaturation typically 
ranges from 0.5 to four minutes, depending mainly on the temperature 
and nucleic acid length. 

After this time, the temperature is decreased to a level 
which promotes hybridization of the primer to its complementary 
25 single- stranded molecule (template) produced from the previous step. 
Such temperature is described above. 

After this hybridization step, or in lieu of (or 
concurrently with) the hybridization step, the temperature is adjusted 
to a temperature that is effective to promote the activity of the 
30 thermostable enzyme to enable synthesis of a primer extension product 
using as template the newly synthesized strand from the previous 
step. The temperature again must not be so high as to separate 
(denature) the extension product from its template, as previously 
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described (usually from 40 to 80 C C for 0.5 to 40 minutes, preferably 
50 to 70* C for one-three minutes). Hybridization may occur during 
this step* so that the previous step of cooling after denaturation is 
not required. In such a case, using simultaneous steps, the preferred 

c 

5 temperature range is 50-70 C. 

The heating and cooling steps of strand separation, 
hybridization, and extension product synthesis can be repeated as 
often as needed to produce the desired quantity of the specific 
nucleic acid sequence, depending on the ultimate use. The only 

10 limitation is the amount of the primers, thermostable enzyme and 
nucleotide triphosphates present. Preferably, the steps are repeated 
at least twice. For use in detection, the number of cycles will 
depend, e.g., on the nature of the sample. For example, fewer cycles 
will be required if the sample being amplified is pure. If the sample 

15 is a complex mixture of nucleic acids, more cycles will be required to 
amplify the signal sufficiently for its detection. For general 
amplification and detection, preferably the process is repeated at 
least 20 times. 

When labeled sequence-specific probes are employed as 
20 described below, preferably the steps are repeated at least five 
times. When human genomic DNA is employed with such probes, the 
process is repeated preferably 15-30 times to amplify the sequence 
sufficiently that a. clearly detectable signal is produced, i.e., so 
that background noise does not interfere with detection. 

25 As will be described in further detail below, the amount of 

the specific nucleic acid sequence produced will accumulate in an 

exponent i al f ashi on . 

No additional nucleotides, primers, or thermostable enzyme 

need be added after the initial addition, provided that the enzyme has 
30 not become denatured or inactivated irreversibly, in which case it is 

necessary to replenish the enzyme after each denaturing step. 

Addition of such materials at each step, however, will not adversely 

affect the reaction. 
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When it is desired to produce more than one specific nucleic 
acid sequence from the first nucleic acid or mixture of nucleic acids, 
the appropriate number of different oligonucleotide primers are 
utilized. For example, if two different specific nucleic acid 
sequences are to be produced, four primers are utilized. Two of the 
primers are specific for one of the specific nucleic acid sequences 
and the other two primers are specific for the second specific nucleic 
acid sequence. In this manner, each of the two different specific 
sequences can be produced exponentially by the present process. 

After the appropriate length of time has passed to produce 
the desired amount of the specific nucleic acid sequence, the reaction 
may be halted by inactivating the enzyme in any known manner (e.g., by 
adding EOT A, phenol, SOS or CHCI3) or by separating the components of 
the reaction. 

lj The amplification process may be conducted continuously. In 

one embodiment of an automated process, the reaction mixture may be 
temperature cycled such that the temperature is programmed to be 
controlled at a certain level for a certain time. 



10 



20 



One such instrument for this purpose utilizes a liquid 
handling system under computer control to make liquid transfers of 
enzyme stored at a controlled temperature in a first receptacle into a 
second receptacle whose temperature is controlled by the computer to 
conform to a curtain incubation profile. The second receptacle stores 
the nucleic acid sequence(s) to be amplified plus the nucleotide 
25 triphosphates and primers. The computer includes a user interface 
through which a user can enter process parameters that control the 
characteristics of the various steps in the amplification sequence 
such as the times and temperatures of incubation, the amount of enzyme 
to transfer, etc. 

30 * preferred machine that may be employed utilizes 

temperature cycling without a liquid handling system because the 
enzyme need not be transferred at every cycle. Such a machine is 
described more completely in European Patent Publication No. 236,069, 
published September 9, 1987, the disclosure of which is incorporated 
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herein by reference. Briefly, this instrument consists of the 
following systems: 

1. A heat-conducting container for holding a given number 
of tubes, preferably 500^)1 tubes, which contain the reaction mixture 

5 of nucleotide triphosphates, primers, nucleic acid sequences, and 
enzyme. 

2. A means to heat, cool, and maintain the heat-conducting 
container above and below room temperature, which means has an input 
for receiving a control signal for controlling which of the 

10 temperatures at or to which the container is heated, cooled or 
maintained. (These may be Peltier heat pumps available from Materials 
Electronics Products Corporation in Trenton, NJ or a water heat 
exchanger. ) 

3. A computer means (e.g., a microprocessor controller), 
15 coupled to the input of said means, to generate the signals that 

control automatically the amplification sequence, the temperature 
levels, and the temperature ramping and timing. 

A representative ampl if ication protocol for double-stranded 
DNA containing the desired sequence [s] comprised of complementary 

20 strands £s + J and [Vj is as follows. Ouring the first and each 
subsequent reaction cycle, extension of each oligonucleotide primer on 
the original template will produce one new ssDNA molecule product of 
indefinite length that terminates with only one of the primers. These 
products, hereafter referred to as "long products," will accumulate in 

25 a linear fashion; that is, the amount present after any number of 
cycles will be proportional to the number of cycles. 

The long products thus produced will act as templates for 
one or the other of the oligonucleotide primers during subsequent 
cycles and will produce molecules of the desired sequence [Vj or 
30 £s"J. These molecules will also function as templates for one or the 
other of the oligonucleotide primers, producing further [s + j and [Vj, 
and thus a chain reaction can be sustained that will result in the 
accumulation of S at an exponential rate relative to the number of 
cycles. 
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3 By-products formed by oligonucleotide hybridizations other 

than those intended are not self-catalytic (except in rare instances) 
and thus accumulate at a linear rate. Each strand which terminates 
with the oligonucleotide sequence of one primer and the complementary 
sequence of the other is the specific nucleic acid sequence S that 
10 is desired to be produced. 

The amount of original nucleic acid remains constant in the 
entire process, because it is not replicated. The amount of the long 
products increases linearly because they are produced only from the 
original nucleic acid. The amount of the specific sequence increases 
^ exponentially. Thus, the specific sequence will become the 
predominant species. This is illustrated in the following table, 
which indicates the relative amounts of the species theoretically 
present after n cycles, assuming 1001 efficiency at each cycle: 



20 



Number of Double Strands 
After 0 to n Cycles 



30 



35 



Cycle Number 

0 

1 

2 

3 

5 
10 
15 
20 

n 



Tempi ate 
1 
1 
1 
1 
1 
1 
1 
1 
1 



Long 
Products 



1 
2 
3 
5 
10 
15 
20 
n 



Speci fic 
Sequence S 



0 
1 
4 
26 
1013 
32,752 
1,048,555 
(2 n -n-l) 



When a single-stranded nucleic acid is utilized as the template, only 
one long product is formed per cycle. 

A sequence within a given sequence can be amplified after a 
given number of amplifications to obtain greater specificity of the 
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5 reaction by adding after at least one cycle of amplification a set of 
primers that are complementary to internal sequences (that are not on 
the ends) of the sequence to be amplified. Such primers may be added 
at any stage and will provide a shorter amplified fragnent. 
Alternatively, a longer fragment can be prepared by using primers with 

10 non-complementary ends but having some overlap with the primers 
previously utilized in the amplification. 

The amplification method may be utilized to clone a 
particular nucleic acid sequence for insertion into a suitable 
expression vector. The vector may be used to transform an appropriate 
15 host organism to produce the gene product of the sequence by standard 
methods of recombinant DNA technology. Such cloning may involve 
direct ligation into a vector using blunt-end ligation, or use of 
restriction enzymes to cleave at sites contained within the primers. 

In addition, the amplification process can be used for vn_ 
20 vi tro mutagenesis. The oligodeoxyribonucleotide primers need not be 
exactly complementary to the DNA sequence that is being amplified. It 
is only necessary that they be able to hybridize to the sequence 
sufficiently well to be extended by the thermostable enz>me. The 
product of an amplification reaction wherein the primers employed are 
25 not exactly complementary to the original template will contain the 
sequence of the primer rather than the template, thereby introducing 
an vn^ vitro mutation. In further cycles this mutation will be 
amplified with an undiminished efficiency because no further mispaired 
priming is required. The mutant thus produced may be inserted into an 
30 appropriate vector by standard molecular biological techniques and 
might confer mutant properties on this vector such as the potential 
for production of an altered protein. 

The process of making an altered DNA sequence as described 
35 above could be repeated on the altered DNA using different primers to 
induce further sequence changes. In this way, a series of mutated 
sequences could gradually be produced wherein each new addition to the 
series could differ from the last in a minor way, but from the 
original DNA source sequence in an increasingly major way. In this 
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manner, chances could be made ultimately which were not feasible in a 
single step due to the inability of a very seriously mismatched primer 
to function. 

In addition, the primer can contain as part of its sequence 
5 a non-complementary sequence, provided that a sufficient amount of the 
primer contains a sequence that is complementary to the strand to be 
amplified. For example, a nucleotide sequence that is not 
complementary to the template sequence (such as, e.g., a promoter, 
linker, coding sequence, etc.) may be attached at the 5' end of one or 
10 both of the primers, and thereby appended to the product of the 
amplification process. After the extension primer is added, 
sufficient cycles are run to achieve the desired amount of new 
template containing the non-complementary nucleotide insert. This 
allows production of large quantities of the combined fragments in a 
15 relatively short period of time (e.g., two hours or less) using a 
simple technique. 

The amplification method may also be used to enable 
detection and/or characterization of specific nucleic acid sequences 
associated with infectious diseases, genetic disorders or cellular 
20 disorders such as cancer, e.g., oncogenes. Anplifi cation is useful 
when the amount of nucleic acid available for analysis is very small, 
as, for example, in the prenatal diagnosis of sickle cell anemia using 
DNA obtained from fetal cells. Amplification is particularly useful 
if such an analysis is to be done on a small sample using non- 
25 radioactive detection techniques which may be inherently insensitive, 
or where radioactive techniques are being employed, but where rapid 
detection is desirable. 

For the purposes of this invention, genetic diseases may 
include specific deletions and/or mutations in genomic DNA from any 
30 organism, such as t e.g., sickle cell anemia, cystic fibrosis, alpha- 
thalassemia, beta-thalasseraia, and the like. Sickle cell anemia can 
be readily detected via oligomer restriction analysis as described by 
EP Patent Publication 164,054 published December 11, 1985, or via a 
RFLP-like analysis following amplification of the appropriate DNA 
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sequence by the amplification method. Alpha-Thai assemi a can be 
detected by the absence of a sequence, and beta-thai assemia can be 
detected by the presence of a polymorphic restriction site closely 
linked to a mutation that causes the disease. 

5, All of these genetic diseases may be detected by amplifying 

the appropriate sequence and analyzing it by Southern blots without 
using radioactive probes. In such a process, for example, a small 
sample of DNA from, e.g., amniotic fluid containing a very low level 
of the desired sequence is amplified, cut with a restriction enzyme, 

10 and analyzed via a Southern blotting technique. The use of non- 
radioactive probes is facilitated by the high level of the amplified 
si gnal . 

In another embodiment, a small sample of ONA may be 
amplified to a convenient level and then a further cycle of extension 
15 reactions performed wherein nucleotide derivatives which are readily 
detectable (such as 32 P-labeled or biotin-labeled nucleotide 
triphosphates) are incorporated directly into the final DNA product, 
which may be analyzed by restriction and electrophoretic separation or 
any other appropriate method. 

20 In a further embodiment, the nucleic acid may be exposed to 

a particular restriction endonuclease prior to amplification. Since a 
sequence which has been cut cannot be amplified, the appearance of an 
amplified fragment,* despite prior restriction of the DNA sample, 
implies the absence of a site for the endonuclease within the 

25 amplified sequence. The presence or absence of an amplified sequence 
can be detected by an appropriate method. 

A practical application of the amplification technique, that 
is, in facilitating the detection of sickle cell anemia via the 
oligomer restriction technique described in EP 164,054, supra, and by 
30 Saiki et al., Bio/Technology , Vol. 3, pp. 1008-1012 (1985) is 
described in detail in the Saiki et al. Science article cited above. 
In that Science article, a specific amplification protocol is 
exemplified using a beta-globin gene segnent. 
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The amplification method herein may also be used to detect 
directly single-nucleotide variations in nucleic acid sequence (such 
as genomic ONA) using sequence-speci fic oligonucleotides, as described 
more fully in European Patent Publication 237,362, published September 
3 16, 1987, the disclosure of which is incorporated herein by reference. 

Briefly, in this process, the amplified sample is spotted 
directly on a series of membranes, and each membrane is hybridized 
with a different labeled sequence-specific oligonucleotide probe. 
After hybridization the sample is washed and the label is detected. 
10 This technique is especially useful in detecting ONA polymorphisms. 

Various infectious diseases can be diagnosed by the presence 
in clinical samples of specific DNA sequences characteristic of the 
causative microorganism. These include bacteria, such as Salmonella, 
Chlamydia, Neisseria; viruses, such as the hepatitis viruses, and 

la parasites, such as the Plasmodium responsible for malaria. U.S. 
Patent Reexamination Certificate Bl 4,358,535 issued to Falkow et al . 
on May 13, 1986 describes the use of specific DNA hybridization probes 
for the diagnosis of infectious diseases. A relatively small number 
of pathogenic organisms may be present in a clinical sample from an 

20 infected patient and the DNA extracted from these may constitute only 
a very small fraction of the total DNA in the sample. Specific 
amplification of suspected pathogen-specific sequences prior to 
immobilization and detection by hjLridization of the DNA samples could 
greatly improve the sensitivity and specificity of traditional 

-5 procedures. 

Routine clinical use of DNA probes for the diagnosis of 
infectious diseases would be simplified considerably if non- 
radioactive! y labeled probes could be employed as described in EP 
63,879 to Ward. In this procedure biotin-containing DNA probes are 
detected by chromogenic enzymes linked to avidin or biotin-speci fic 
antibodies. This type of detection is convenient, but relatively 
insensitive. The combination of specific DNA amplification by the 
present method and the use of stably labeled probes could provide the 
convenience and sensitivity required to make the Falkow et al . and 
35 Ward procedures useful in a routine clinical setting. 



30 
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A specific use of the amplification technology for detecting 
or monitoring for the AIDS virus is described in European Patent 
Publication 229,701, published July 22, 1987, the disclosure of which 
is incorporated herein by reference. Briefly, the amplification and 

5 detection process is used with primers and probes which are designed 
to amplify and detect, respectively, nucleic acid sequences that are 
substantially conserved among the nucleic acids in AIDS viruses and 
specific to the nucleic acids in AIDS viruses. Thus, the sequence to 
be detected must be sufficiently complementary to the nucleic acids in 

10 AIDS viruses to initiate pol>meri zation preferably at room temperature 
in the presence of the enzyme and nucleotide triphosphates. 

A preferred amplification process uses labeled primers. The 
label on the amplified product may be used to "capture" or immobilize 
the product for subsequent detection (e.g., biotinylated amplification 

15 primers yield labeled products that can be "captured" by their 
interaction with avidin or strepavidin). As demonstrated in the 
aforementioned amplification protocols, the extension product of one 
labeled primer when hybridized to the other becomes a template for the 
production of the desi red " speci f ic nucleic acid sequence, and vice 

20 versa, and the process is repeated as often as necessary to produce 
the desired amount of the sequence. Examples of specific preferred 
reagents that can be employed as the label are provided in U.S. Patent 
No. 4,582,789, the disclosure of which is incorporated herein by 
reference. 

25 The amplification process can also be utilized to produce 

sufficient quantities of DNA from a single copy human gene such that 
detection by a simple non-specific DNA stain such as ethidium bromide 
can be employed to diagnose DNA directly. 

In addition to detecting infectious diseases and 

30 pathological abnormalities in the genome of organisms, the 

amplification process can also be used to detect ONA polymorphisms 

which may not be associated with any pathological state. 

In summary, the amplification process is seen to provide a 
process for amplifying one or more specific nucleic acid sequences 
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using a chain reaction and a thermostable enzyme, in which reaction 
primer extension products are produced which can subsequently act as 
templates for further primer extension reactions. The process is 
especially useful in detecting nucleic acid sequences which are 
5 initially present in only very small amounts. 

The following examples are offered by way of illustration 
only and are by no means intended to limit the scope of the claimed 
invention. In these examples, all percentages are by weight if for 
solids and by volume if for liquids, unless otherwise noted, and all 
10 temperatures are given in degrees Celsius. 

EXAMPLE I 

A. Synthesis of the Primers 

The following two oligonucleotide primers were prepared by 
the method described below: 

15 5 1 -ACACAACTGTGTTCACTAGC-3 ' ( PC03) 

5 ' - CAACTTCATCCACGTTCACC-3 ' ( PC04) 

These primers, both 20-mers, anneal to opposite strands of the genomic 
DNA with their . 5 1 ends separated by a distance of 110 base pairs. 

1. Automated Synthesis Procedures: The 
20 diethyl phosphoramidi tes, synthesized according to Beaucage and 

Caruthers ( Tetrahedron Letters (1981) 22:1859-1862) were sequentially 
condensed to a nucleoside derivatized controlled pore glass support 
using a Biosearch SAM-1. The procedure included detrityl ation with 
trichloroacetic acid in dichlororaethane, condensation using 

25 benzotriazol e as activating proton donor, and capping with acetic 
anhydride and dimethyl ami nopyridine in tetrahydrofuran and pyridine. 
Cycle time was approximately 30 minutes. Yields at each step were 
essentially quantitative and were determined by collection and 
spectroscopic examination of the dimethoxytri tyl alcohol released 

30 during detrityl ation. 

2. Oligodeoxyribonucleotide Deprotection and ftirif ication 
Procedures; The solid support was removed from the column and exposed 
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to 1 ml concentrated ammonium hydroxide at room temperature for four 
hours in a closed tube. The support was then removed by filtration 
and the solution containing the partially protected 
oligodeoxynucleotide was brought to 55 °C for five hours. Ammonia was 

5 removed and the residue was applied to a preparative polyacryl amide 
gel. Electrophoresis was carried out at 30 volts/cm for 90 minutes 
after which the band containing the product was identified by UV 
shadowing of a fluorescent plate. The band was excised and eluted 
with 1 ml distilled water overnight at 4 C. This solution was applied 

10 to an Altech RP18 column and eluted with a 7-131 gradient of 
acetonitrile in IX ammonium acetate buffer at pH 6.0. The elution was 
monitored by UV absorbance at 260 nm and the appropriate fraction 
collected, quantitated by UV absorbance in a fixed volume and 
evaporated to dryness at room temperature in a vacuum centrifuge. 

15 3. Characterization of Oligodeoxyribonucleotides : Test 

aliquots of the purified oligonucleotides were labeled with 

polynucleotide kinase and gamma- 32 P-ATP. The labeled compounds were 
examined by autoradiography of 14-20X polyacryl amide gels after 
electrophoresis for 45 minutes at 50 volts/cm. This procedure 

20 verifies the molecular weight. Base composition was determined by 
digestion of the ol igodeoxyri bonucleotide to nucleosides by use of 
venom diesterase and bacterial alkaline phosphatase and subsequent 
separation and quantitation of the derived nucleosides using a reverse 
phase HPLC column ahd a 10* acetonitrile, IX ammonium acetate mobile 

25 phase. 

B. Isolation of Human Genomic ONA from Cell Line 

High molecular weight genomic DNA was isolated from a T cell 
line, Molt 4, homozygous for normal beta-globin available from the 
Human Genetic Mutant Cell Depository, Camden, NJ as GM2219C using 
30 essentially the method of Maniatis et al., supra . p. 280-281. 
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C. Purification of a Polymerase From Thermus aquaticus 

Ihennus aquaticus strain YT1, available without restriction 

from the American Type Culture Collection, 12301 Parklawn Orive, 

Rockville, MO, as ATCC No. 25,104 was grown in flasks in the following 
medium: 

Sodium Citrate l mM 

Potassium Phosphate, pH 7.9 5 mM 

Ammonium Chloride 10 mM 

Magnesium Sulfate 0.2 mM 

Calcium Chloride 0.1 mM 

Sodium Chloride l g/i 

Yeast Extract l g/i 

Tryptone 1 g/i 

Glucose 2 g/1 

Ferrous Sulfate 0.01 mM 

(The pH was adjusted to 8.0 prior to autoclaving. ) 

A 10-liter fermentor was inoculated from a seed flask 
cultured overnight in the above medium at 70 °C. A total of 600 ml 
from the seed flask was used to inoculate 10 liters of the same 
20 medium. The pH was controlled at 8.0 with ammonium hydroxide with the 
dissolved oxygen at 401, with the temperature at 70°C, and with the 
stirring rate at 400 rpm. 

After growth of the cells, they were purified using the 
protocol (with slight modification) of Kaledin et al., supra , through 
23 the first five stages and using a different protocol for the sixth 
stage. All six steps were conducted at 4°C. The rate of 
fractionation on columns was 0.5 columns/hour and the volumes of 
gradients during elution were 10 column volumes. An alternative and 
preferred purification protocol is provided in Example XIII below. 

30 Briefly, the above culture of the T^ aquaticus cells was 

harvested by centrifugation after nine hours of cultivation, in late 
log phase, at a cell density of 1.4 g dry weight/1. Twenty grams of 
cells were resuspended in 80 ml of a buffer consisting of 50 mM 
Tris-HCl pH 7.5, 0.1 mM EDTA. Cells were lysed and the lysate was 

35 ^centri fuged for two hours at 35,000 rpm in a Beckman TI 45 rotor at 4 
C. The supernatant was collected (fraction A) and the protein 
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fraction precipitating between 45 and 75X saturation of ammonium 
sulfate was collected, dissolved in a buffer consisting of 0.2 M 
potassium phosphate buffer, pH 6.5, 10 mM 2-mercaptoethanol , and 51 
glycerine, and finally dialyzed against the same buffer to yield 
fraction B. 

5 Fraction B was applied to a 2.2 x 30-cm column of DEAE- 

cellulose, equilibrated with the above described buffer. The column 
was then washed with the same buffer and the fractions containing 
protein (determined by absorbance at 280 nm) were collected. The 
combined protein fraction was dialyzed against a second buffer, 

10 containing 0.01 M potassium phosphate buffer, pH 7.5, 10 mM 2- 
mercaptoethanol , and 5X glycerine, to yield fraction C. 

Fraction C was applied to a 2.6 x 21- cm column of 
hydroxyapatite, equilibrated with a second buffer. The column was 
then washed and the enzyme was eluted with a linear gradient of 0.01- 
15 0.5 M potassium phosphate buffer, pH 7.5, containing 10 mM 2- 
mercaptoethanol and 5i glycerine. Fractions containing DNA poljmerase 
activity (90-180 mM potassium phosphate) were combined, concentrated 
four-fold using an Amicon stirred cell and YM10 membrane, and dialyzed 
against the second buffer to yield fraction D. 

20 Fraction D was applied to a 1.6 x 28-cm column of DEAE- 

cellulose, equilibrated with the second buffer. The column was washed 
and the polymerase was eluted with a linear gradient of 0.01-0.5 M 
potassium phosphate in the second buffer. The fractions were assayed 
for contaminating endonuclease (s) and exonuclease(s) by 

25 electrophoretically detecting the change in molecular weight of phage 
lambda DMA or supercoiled plasmid DNA after incubation with an excess 
of DNA polymerase (for endonuclease) and after treatment with a 
restriction enzyme that cleaves the DNA into several fragments (for 
exonuclease). Only those DNA polymerase fractions (65-95 mM potassium 

30 phosphate) having raininal nuclease contami nation were pooled. To the 
pool was aided autoclaved gelatin in an amount of 250 Ag/ml, and 
dialysis was conducted against the second buffer to yield Fraction E. 
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Fraction E was applied to a phosphocellulose column and 
eluted with a 100 ml gradient (0.01-0.4 H KC1 gradient in 20 mM 
potassium phosphate buffer pH 7.5). The fractions were assayed for 
contaminating endo/exonuclease (s) as described above as well as for 
polymerase activity (by the method of Kaledin et al.) and then 
pooled. The pooled fractions were dialyzed against the second buffer, 
then concentrated by dialysis against 501 glycerine and the second 
buffer. 

The molecular weight of the polymerase was determined by 
SOS-PAGE analysis. Marker proteins (Bio-Rad low molecular weight 
standards) were phosphoryl ase B (92,500), bovine serum albumin 
(66,200), ovalbumin (45,000), carbonic anhydrase (31,000), soybean 
trypsin inhibitor (21,500), and lysozyme (14,400). 

Preliminary data suggest that the polymerase has a molecular 
weight of about 86,000-95,000 daltons, not 62,000-63,000 daltons 
reported in the literature (e.g., by Kaledin et al.). 

The polymerase was incubated in 50 ^1 of a mixture 
containing either 25 mM Tris-HCl pH 6.4 or pH 8.0, and 0.1 M KC1, 10 
mM MgCl 2 , 1 mM 2-mercaptoethanol , 10 nmoles each of dGTP, dATP, and 
TTP, and 0.5 ^Ci ( 3 H) dCTP, 8/<g "activated* «1f thymus DNA, and 0.5- 
5 units of the polymerase. "Activated- DNA is a native preparation of 
DNA after partial hydrolysis with DNase I until 51 of the DNA was 
transferred to the acid-soluble fraction. The reaction was conducted 
at 70 C for 30 minutes, and stopped by adding 50^1 of a saturated 
aqueous solution of sodium pyrophosphate containing 0.125 M EDTA- 
Na 2 . Samples were processed and activity was determined as described 
by Kaledin et al., supra . 

The results showed that at pH 6.4 the polymerase was more 
than one-half as active as at pH 8.0. In. contrast, Kaledin et al. 
found that at pH about 7.0, the enzyme therein had 81 of the activity 
at pH 8.3. Therefore, the pH profile for tne thermostable enzyme 
herein is broader than that for the Kaledin «t al. enzyme. 

Finally, when only one or more nucleotide triphosphates were 
eliminated from a DNA polymerase assay reaction mixture, very little, 
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if any, activity was observed using the enzyme herein, and the 
activity was consistent with the expected value, and with an enzyme 
exhibiting high fidelity. In contrast, the activity observed using 
the Kaledin et al. ( supra) enz>me is not consistent with the expected 
5 value, and suggests roisincorporation of nucleotide triphosphate^ ). 

D. Amplification Reaction 

One microgram of the genomic DNA described above was diluted 
in an initial 100 1 aqueous reaction volume containing 25 mM Tris*HCl 
buffer (pH 8.0), 50 mM KC1, 10 mM MgC1 2 . 5 mM di thiothrei to! , 

10 200^g/ml gelatin, 1 M W of primer PC03, 1 ^,M of primer PC04, 1.5 mM 
dATP, 1.5 mM dCTP, 1.5 mM dGTP and 1.5 mM TTP. The sample was heated 
for 10 minutes at 98 r C to denature the genomic DNA, then cooled to 
room temperature. Four microliters of the polynerase ff-om Thermus 
aquaticus was added to the reaction mixture and overlaid with a 100x1 

15 mineral oil cap. The sample was then placed in the aluminum heating 
block of the liquid handling and heating instrument described above. 

The DNA sample underwent 20 cycles of amplification in the 
machine, repeating the following program cycle: 

1) heating from 37° C to 98* C in heating block over a period 
20 of 2.5 minutes; and 

2) cooling from 98' C to 37*C over a period of three minutes 
to allow the primers and DNA to anneal. 

After the* last cycle, the sample was incubated for an 
additional 10 minutes at 55* C to complete the final extension 
25 reaction. 

E. Synthesis and Phosphorylation of Oligodeoxyribonucleotide Probes 

A labeled DNA probe, designated RS24, of the following 
sequence was prepared: 

5 • -*CCCACAGGGCAGTMCGGCAGACTTCTCCTCAGGAGTCAG-3 ■ (RS24) 



WO 89/06691 



PCT/US89/00127 



47 



10 



15 



20 



where * indicates the label. This probe is 40 bases long, spans the 
fourth through seventeenth codons of the gene, and is complementary to 
the normal beta-globin allele (beta*). The schematic diagram of 
primers and probes is given below: 

S 110 bp v 

^ beta-globin 7 



PC03 RS24 PW 

This probe was synthesized according to the procedures 
described in Section I of Example I. The probe was labeled by 
contacting 20 pinole thereof with 4 units of T4 polynucleotide kinase 
(New England Biolabs) and about 40 pmole gamma" 32 P-ATP (New England 
Nuclear, about 7000 Ci/mmole) in a 40 yyj reaction volume containing 70 
mM Tris buffer (pH 7.6), 10 mM MgCl 2 , 1.5 mM spermine, and 10 mM 
dithiothreitol for 60 minutes at 37 "c. The total volume was then 
adjusted to 100 ^1 with 25 mM EDTA and the probe purified according to 
the procedure of Maniatis et al., Molecular Cloning (1982), 466-467 
over a 1 ml Bio Gel P-4 (BioRad) spin dialysis column equilibrated 
with Tris-EDTA ( TE) buffer (10 mM Tris buffer, 0.1 mM EDTA, pH 8.0). 
TCA precipitation of the reaction product indicated that for RS24 the 
specific activity was 4.3 ^Ci/pmole and the final concentration was 
0.118 pmole/^1. 

F. Dot Blot Hybridizations 

Four microliters of the amplified sample from Section IV and 
5.6^1 of appropriate dilutions of beta-globin plasmid DNA calculated 
to represent amplification efficiencies of 70, 75, 80, 85, 90, 95 and 

25 100X were diluted with 200 p} 0.4 N NaOH, 25 mM EDTA and spotted onto 
a Genatran 45 (Plasco) nylon filter by first wetting the filter with 
water, placing it in a Bio-Oot (Bio-Rad, Richmond, CA) apparatus for 
preparing dot blots which holds the filters in place, applying the 
samples, and rinsing each well with 0.1 ml of 20 x SSPE (3.6 M NaCl, 

30 200 mM NaH 2 P0 4 . 20 mM EDTA), as disclosed by Reed and Mann, Nucleic 
Acids Research. 13, 7202-7221 (1985). The filters were then removed, 
rinsed in 20 x SSPE, and baked for 30 minutes at 80 °C in a vacuum 
oven. 
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After baking, each filter was then contacted with 16 ml of a 
hybridization solution consisting of 3 x SSPE, 5 x Denhardt's solution 
(1 x = 0.02X polyvinylpyrrolidone, 0.02X Ficoll, 0.021 bovine serum 
albumin, 0.2 mM Tris, 0.2 mM EDTA, pH 8.0), 0.51 SDS and 30% 
5 formamide, and incubated for two hours at 42°C. Then 2 pmole of probe 
RS24 was added to the hybridization solution and the filter was 
incubated for two minutes at 42* C. 

Finally, each hybridized filter was washed twice with 100 ml 
of 2 x SSPE and 0.11 SDS for 10 minutes at room temperature. Then the 
10 filters were treated once with 100 ml of 2 x SSPE, 0.1X SDS at 60° C 
for 10 minutes. 

Each filter was then autoradiographed , with the signal 
readily apparent after two hours. 

G. Discussion of Autoradiogram 

15 The autoradiogram of the dot blots was analyzed after two 

hours and compared in intensity to standard serial dilution beta- 
globin reconstructions prepared with Hae III/Mae I-di gested pBR:beta A , 
where beta A is the wild-type allele, as described in Saiki et al., 
Science , supra . Analysis of the reaction product indicated that the 

20 overall amplification efficiency was about 95S, corresponding to a 
630,000-fold increase in the beta-globin target sequence. 

EXAMPLE II 

A. Ampl if ication Reaction 

Two 1 /<g samples of genomic DNA extracted from the Molt 4 
25 cell line as described in Example I were each diluted in a 100 /J 
reaction volume containing 50 mM KCl, 25 mM Tris-HCl buffer pH 8.0, 10 
mM MgCl 2 , 1 pi\ of primer PC03, 1 /tM of primer PC04, 200 jug/m\ gelatin, 
10X dimethyl sulfoxide (by volume), and 1.5 mM each of dATP, dCTP, dGTP 
and TTP. After this mixture was heated for 10 minutes at 98* C to 
30 denature the genomic DMA, the samples were cooled to room temperature 
and 4 ju! of the polymerase from Thennus aquaticus described in Example 
1 was added to each sample. The samples were overlaid with mineral 
oil to prevent condensation and evaporative loss. 
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One of the samples was placed in the heating block of the 
machine described in Example I and subjected to 25 cycles of 
amplification, repeating the following program cycle: 

(1) heating from 37 to 93*C over a period of 2.5 minutes; 

5 (2) cooling from 93° C to 37 "c over a period of three 

minutes to allow the primers and DNA to anneal; and 

(3) maintaining at 37*C for two minutes. 

After the last cycle the sample was incubated for an 
additional 10 minutes at 60° C to complete the final extension 
10 reaction. 

The second sample was placed in the heat-conducting 
container of the machine, described in more detail in EP 236,069, 
supra_. The heat-conducting container is attached to Peltier heat 
pumps which adjust the temperature upwards or downwards and a 
15 microprocessor controller to control automatically the amplification 
sequence, the temperature levels, the temperature "ramping and the 
timing of the temperature. 

The second sample was subjected to 25 cycles of 
amplification, repeating the following program cycle: 

20 heating from 37 to 95°C over a period of three minutes; 

(2) maintaining at 95 c C for 0.5 minutes to allow 
denaturation to occur; 

(3) cooling from 95 to 37° C over a period of one minute; 

and 

25 (4) maintaining at 37 "c for one minute. 

B. Analysis 

Two tests were done for analysis, a dot blot and an agarose 
gel analysis. 

For the dot blot analysis, a labeled DNA probe, designated 
30 RS18, of the following sequence was prepared. 
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5 ' -*(.TCC7GAGGAGAAGTCTGC-3 ' I RSI 3) . 

where * indicates the label. This probe is 13 bases long, spans the 
fourth through seventeenth cocons of the gene, and is csmol ementary to 

A 

the normal oeta-globin allele (.bets^}. Tfie schematic diagram of 
primers and probes is given below: 



s 110 bp v 

^ beta-globi n 7 



PC03 RSI 8 PC04 

This probe was synthesized according to the procedures 
10 described in Section I of Example I. The probe was labeled by 
contacting 10 pmole thereof with 4 units of T4 polynucleotide kinase 
(New England Biolabs) and about 40 pmole gamma^P-ATP (New England 
Nuclear, about 7000 Ci/mmole) in a 40^1 reaction volume containing 70 
mM Tris-HCl buffer (pH 7.6), 10 mM MgCl 2 . 1-5 mM spermine and 10 mM 
15 dithiothrei to! for 60 minutes at 37°C. The total volume was then 
adjusted to 100 ^u.1 with 25 mM EDTA and purified according to the 
procedure of Maniatis et al;, supra , p. 466-467 over a 1 ml Bio Gel P- 
4 (BioRad) spin dialysis column equilibrated with Tris-EDTA (TE) 
buffer (10 mM Tris-HCl buffer, 0.1 mM EDTA, pH 8.0). TCA 
20 precipitation of the reaction product indicated that for RS18 the 
specific activity was 4.6 yuCi/pmole and the final concentration was 
0.114 pmole/^l. 

Five microliters of the amplified sample from Section I and 
of a sample amplified as described above except using the Klenow 

25 fragnent of col i DMA Polymerase I instead of the thermostable 
enzyme were diluted with 195 //I 0.4 N NaOH, 25 mM EDTA and spotted 
onto two replicate Genatran 45 (Plasco) nylon filters by first wetting 
the filters with water, placing them in a Bio-Dot (Bio-Rad, Richmond, 
CA) apparatus for preparing dot blots which holds the filters in 

30 place, applying the samples, and rinsing each well with 0.4 ml of 20 x 
SSPE (3.6 M NaCl, 200 mM NaH 2 P0 4 , 20 mM EDTA), as disclosed by Reed 
and Mann, supra . The filters were then removed, rinsed in 20 x SSPE, 
and baked for 30 minutes at 80°C in a vacuum oven. 
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After baking, each filter was then contacted with 5 nil of a 
hybridization solution consisting of 5 x SSPE, 5 x Oenhardt's solution 
(1 x = 0.02? polyvinylpyrrolidone, 0.025 Ficoll, 0.025 bovine serum 
albumin, 0.2 mM Tris, 0.2 mM ED7A, pH 8.0) and 0.55 SDS, and incited 
for 60 minutes at 55 C. Then 5Vi of probe RSlo was added to the 
hybridization solution and the filter was incubated for £0 minutes at 
55 C. 

Finally, each hybridized filter was washed twice with 100 ml 
of 2 x SSPE and 0.15 SDS for 10 minutes at room temperature. Tnen the 
filters were treated twice more with 100 ml of 5 x SSPE, 0.15 SDS at 
50 C for 1) one minute and 2) three minutes, respectively. 

Each filter was then autoradiographed, with the signal 
readily apparent after 90 minutes. 

In the agarose gel analysis, 5^1 each amplification 
reaction was loaded onto 45 NuSieve/0.55 agarose gel in 1 x TBE buffer 
(0.089 M Tris, 0.089 M boric acid, and 2 mM EDTA) and el ectrophoresed 
for 60 minutes at 100V. After staining with ethidium bromide, DNA was 
visualized by UV fluorescence. 

The results show that the machines used in Example I and 
this example were equally effective in amplifying the DNA, showing 
discrete -hi gh- intensity 110-base pair bands of similar intensity, 
corresponding to the desired sequence, as well as a few other discrete 
bands of much lower- intensity. In contrast, the amplification method 
which involves reagent transfer after each cycle using the Klenow 
fragment of E. coli Polymerase I, gave a DNA smear resulting from the 
non-specific amplification of many unrelated DNA sequences. 

It is expected that similar improvements in amplification 
and detection would be achieved in evaluating HLA-DQ, DR and DP 
regions. 

If in the above experiments the amplification reaction 
buffer contains 2 mM MgCl 2 instead of 10 mM MgCl? and 150-200 >,M of 
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each nucleotide rather than 1.5 mM of each, and if the lower 
temperature of 37*C is raised to 45-58° C during amplification, better 
specificity and efficiency of amplification occurs. Also, QMSO *as 
found not necessary or preferred for amplification. 

5 EXAMPLE III 

ftnpl i fication and Cloning 

For amplification of a 119-base pair fragment on the human 
beta-globin gene, a total of 1 microgram each of human genomic DNA 
isolated from the Holt 4 cell line or from the GM2064 cell line 

10 (representing a homozygous deletion of the beta- and delta-hemoglobin 
region and available from the Human Genetic Mutant Cell Depository. 
Camden, NJ) as described above was amplified in a 100^1 reaction 
volume containing 50 mM KC1 , 25 mM Tris'HCl pH 8, 10 mM MgCl 2 » 
200 /JLg/ml gelatin, 5 mM 2-mercaptoethanol , 1.5 mM each of dATP, dCTP, 

15 TTP, and dGTP, and l^tM of each of the following primers: 

5 '-CTTCTGcagCAACTGTGTTCACTAGC-3 ' (GH18) 
S'-CACaAgCTTCATCCACGTTCACC-S 1 (GH19) 

where lower case letters denote mismatches from wild-type sequence to 
create restriction enzyme sites. GH18 is a 26-base oligonucleotide 

20 complementary to the negative strand and contains an internal PstI 
site. GH19 is a 23-base oligonucleotide complementary to the plus 
strand and contains .an internal Hind lll recognition sequence. These 
primers were selected- by first screening the regions of the gene for 
homology to the PstI and Hind lll restriction sites. The primers were 

25 then prepared as described in Example I. 

The above reaction mixtures were heated for 10 minutes at 95 
^ and then cooled to room temperature. A total of 4 ju\ of the 
polymerase described in Example I was added to each reaction mixture, 
and then each mixture was overlayed with mineral oil. The reaction 
30 mixtures were subjected to 30 cycles of amplification with the 
following program: 
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2.5 rain, ramp, 37 to 98 C 
3 min. ramp, 98 to 37 C 
2 rain, soak, 37 C 



After the last cycle, the reaction mixtures were incubated 
for 20 minutes at 65 C to complete the finai extension. The mineral 
^oil was extracted *ith chloroform and the mixtures were stored at -20 



A total of 10 ^ul of the amplified product was digested with 
0.5 jig MianplO cloning vector, which is publicly available from 
Boehringer-Mannheim. in a 50 volume containing 50 mM NaCl, 10 mM 
Tris-HCl, pH 7.8, 10 mM MgCl 2 , 20 units PstI and 26 units Hind I II for 
90 minutes at 37 C C. The reaction was stopped by freezing at -20 °C. 
The volume was adjusted to llOyul with TE buffer and loaded (100 /J ) 
onto a 1 ml BioGel P-4 spin dialysis column. One 0.1 ml fraction was 
15 collected and ethanol precipitated. 

(At this point it was discovered that there was • beta-globin 
amplification product in the. GM2064 sample. Subsequent experiments 
traced the source of contamination to the primers, either GH18 or 
SH19. Because no other* source of primers was available, the 
20 experiment was continued with the understanding that some cloned 
sequences would be derived from the contaminating DNA in the primers.) 

The ethanol pellet was resuspended in 15^1 water, then 
adjusted to 20^1 volume containing 50 mM Tris-HCl, pH 7.8, 10 mM 
MgCl 2 . 0.5 mM ATP, 10 mM dithiothreitol , and 400 units ligase. This 
25 mixture was incubated for three hours at 16°C. 

Ten microliters of ligation reaction mixture containing Molt 
4 DNA was transformed into E. coli strain JM103 competent cells, which 
are publicly available from BRL in Bethesda, MD. The procedure 
followed for preparing the transformed strain is described in Messing, 
J - (19B1) Third Cle veland Symposium on Macromol ecu! es : Recombinant DNA . 
ed. A. Walton, Elsevier, Amsterdam, 143-163. A total of 651 colorless 
plaques (and 0 blue plaques) were obtained. Of these, 119 had a (+)- 
strand insert (18X) and 19 had a (-)- strand insert (31). This is an 
increase of almost 20-fold over the percentage of beta-globin positive 



plaques among the primer-positive plaques from the amplification 
technique using Klenow fragment of col i Polymerase i, where the 
reaction proceeded for two minutes at 25°C, after which the steps of 
heating to 100°Cfor two minutes, cooling, adding Xlenow fragnent, tr.c 
reacting were repeated nine times. These results confirm the improved 
specificity of the amplification reaction employing the thermostable 
enzyme herein. 

In a later cloning experiment with GM2064 and the 
contaminated primers, 43 out of 510 colorless plagues (81) had the 
(+)- strand insert. This suggests that approximately one-half of the 
119 clones from Molt 4 contain the contaminant sequence. 

Ten of the (+)- strand clones from Molt 4 were sequenced. 
Five were normal wild-type sequence and five had a single C to T 
mutation in the third position of the second codon of the gene (CAC to 
CAT). Four of the contaminant clones from GM2064 were sequenced and 
all four were normal. 

Restriction site-modified' primers may also be used to 
amplify and clone and partially sequence the human N-ras oncogene and 
to clone base pair segments of the HLA DQ-alpha, DQ-beta and DR-beta 
genes using the above technique. 

Again, if the concentrations of MgCl 2 and nucleotides are 

reduced to 2 mM and 150-200 uM, respectively, and the minimum cycling 

c ° 

temperature is increased from 37 C to 45-58 C, the specificity and 
efficiency of the amplification reaction can be increased. 

EXAMPLE IV 

Gene Retrieval 

A. IDENTIFICATION OF A DNA SEQUENCE PROBE FOR THE TAQ POLYMERASE GENE 

A specific ONA sequence probe for the Taq pol gene was 
obtained following imnunological screening of a lambdagtll expression 
library. aquaticus DNA «as digested to completion with Alul, 

li gated with Eco R I 12-mer linkers (CCGGAATTCCGG, New England Biolabs), 
digested with EcoR I and ligated with dephosphorylated, EcoRI-di gested 
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lambdagtll DNA { Promega Biotech). The li gated DNA was packaged 
(Gigapack Plus, Stratagene) and transfected into t. col i K- 12 strain 
Y1090 (provided by R. Young). 

The initial library of 2 x 10° placues was screened (Young, 
5 R.A., and R.W. Davis (1333) Science , 222^:773-782) with a 1:2000 
dilution of a rabbit polyclonal antiserum raised to purified Taq 
polymerase (see Examples I and XIII). Candidate plaques were replated 
at limiting dilution and rescreened until homogeneous (^3 cycles). 
Phage were purified from candidate plaques which failed to react with 
10 preimmune serum and reacted with immune serum. 

Candidate phage were used to lysogenize £. col i K- 12 strain 
Y1089 (R. Young). Lysogens were screened for the production of an 
IPTG inducible fusion protein (larger than beta-galactosidase) which 
reacted with the Taq pol>merase antiserum. Solid phase, size- 
15 fractionated fusion proteins were used to affinity purify epi tope- 
specific antibodies from the total polyclonal antiserum (Goldstein, 
L.S.B., et al. (1986) J. Cell Biol. 102 :2076-2087). 

The "fished" , epi tope- selected antibodies were used, in 
turn, in a Western analysis to identify which lambdagtll phage 

20 candidates encoded DNA sequences uniquely specific to Taq 
polymerase. One lambdagtll phage candidate, designated lambdagt:!, 
specifically selected antibodies from the total rabbit polyclonal Taq 
polymerase antiserum, which uniquely reacted with both purified Taq 
polymerase and crude* extract fractions containing Taq polymerase. 

25 This phage, lambdagt :1, was used for further study. 

The w 115 bp EcoRI -adapted Alu l fragment of Thermus aquaticus 
DNA was labeled (Maniatis et al,, supra) to generate a Taq pol>merase- 
specific probe. The probe was used in Southern analyses and to screen 
a T. aquaticus DNA random genomic library. 

30 B. CONSTRUCTION AND SCREENING OF A THERMUS AQUATICUS RANDOM GENOMIC 
LIBRARY- 

Lambda phage Charon 35 (Uilhelraine, A. M..et al., supra) was 
annealed and li gated via its cohesive ends, digested to completion 
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with BamHI, and the annealed arms were purified from the "stuffer" 
fragments by potassium acetate density gradient ultracentri fucation 
(Maniatis, et al.. supra) . T. aouaticus ONA was partially digested 
with Sau3 A and the 15-20 kb size fraction purified by sucrose density 

5 gradient ul tracentri fugation. The random genomic library was 
constructed by ligating the target and vector DNA fragments at -a 1:1 
molar ratio. The ONA was packaged and transfected into JE. coli K-12 
strains LE392 or K802. A library of ^20,000 initial phage 
containing ;>99I recombinants was amplified on Z. coli K-12 strain 

10 LE392. 

The CH35 Taq genomic phage library was screened (Maniatis et 
al., supra) with the radiolabeled EcoR I insert of gtll:l. 
Specifically hybridizing candidate phage plaques were purified and 
further analyzed. One phage, designated Ch35::4-2, released > four X- 
15 aquaticus DNA fragments upon digestion with Hindi II (^8.0, 4.5, 0.8, 
0.58 kb) 

The four Hindi II T^ aquati cus DNA fragments were li gated 
with Hindlll digested plasmid BSM13 + (3.2 kb, Vector Cloning Systems. 
San Diego) and individually cloned following transformation of E_. coli 
20 K-12 strain DG98. 

The 8.0 kb Hindi II DNA fragment from CH35::4-2 was isolated 
in plasmid pFC82 (11.2 kb), while the 4.5 kb Hindlll DNA fragment from 
CH35::4-2 was isolated in plasmid pFC83 (7.7 kb). 

Z. coli strain DG98 harboring pFC82 was shown to contain a 
25 thermostable, high temperature DNA polymerase activity (Table 1). In 
addition, these cells synthesize a new ^60 kd molecular weight 
polypeptide which is immunologically related to Taq DNA polymerase. 

The Taq polymerase coding region of the 8.0 kb Hindlll DNA 
fragment was further localized to the lac -promoter proximal 2.68 kb 
30 Hindlll to Asp718 portion of the 8.0 kb Hindlll fragment. This region 
was subcloned to yield plasmid pFC85 (6.0 kb). Upon induction with 
IPT6, E. coli DG98 cells harboring plasmid pFC85 synthesize up to 100- 
fold more thermostable, Taq polymerase- related activity (Table 1) than 
the original parent clone (pFC82/DG98). While cells harboring pFC85 
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synthesize a significant amount of a thermostable ONA polymerase 
activity, only a portion of the Taq jjo1_ ONA sequence is translated, 
resulting in the accumulation of a ^60 kc Taq polymerase- rsl ated 
polypeptide. 

TABLE 1 

Expression of a Thermostable DMA Polymerase Activity in E^ coli # 

Sample Units */ml 

IPT6 +IPTG 

BSM13/DG98 . 0 .02 

10 pFC82/DG98 2.2 2.7 

PFC85/DG98 H.g 6 43.8 



25 



Cells were grown to late log phase (+/- IPTG, 10 nrt). harvested, 
sonicated, heated at 75 C for 20 minutes, centrifuged and the 
clarified supernatant assayed at 70 C for DNA polymerase activity. 

* 1 unit = 1 nMole dCTP incorporated in 30 minutes. 

EXAMPLE V 

Expression of Taq Polymerase 

The thermostable gene of the present invention can be 
expressed in any Of a variety of bacterial expression vectors 
including 0G141 (ATCC 39588) and pP L N RBS ATG, vectors disclosed in U.S. 
Patent No. 4,711,845, the disclosure of which is incorporated herein 
by reference. Both of these host vectors are pBR322 derivatives that 
have either a sequence containing a tryptophan promoter-operator and 
ribosome binding site with an operably linked ATG start codon (0G141) 
or a sequence containing the lambda P L promoter and gene N ribosome 
binding site operably linked to an ATG start codon (pP L % s ATG). 
Either one of these host vectors may be restricted with SacI, and 
blunt ended with Klenow or SI nuclease to construct a convenient 
restriction site for subsequent insertion of the Taq polymerase gene. 
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The full-length Taq polymerase gene was constructed from the 
ONA insert fragments subcloned into plasmids pFC83 and pFC35 as 
follows- Vector BSM13 + (commercially available from Vector Cloning 
Systems, San Diego, CA) was digested at the unique Hind I II site, 

5 repaired with Klenow and dNTPs, and ligated with 74 DNA ligase to a 
Bgl I I octanucleotide linxer, 5 1 -CAGATCTG-3 ' (New England Biolabs), and 
transformed into col i strain 0G98. Plasmids were isolated from 
Anp R lac Zal pha* transformants. One of the clones was digested with 
Bgl II and Asp718 restriction enzymes, and the large vector fragment 

10 purified by gel electrophoresis. 

Next, plasmid pFC83 was digested with Bgl II and Hindlll and 
the "730 base pair fragment was isolated. Plasmid pFC85 was digested 
with Hindi II and Asp718 and the *>2.68 kb fragment isolated and joined 
in a three-piece ligation to the' v 730 base pair Bgl I I -Hind III fragment 

15 from pFC83 and the Bgl II-Asp718 vector fragment of BSM13 + . This 
ligation mixture was used to transform coli strain DG98 (ATCC 
39,768 deposited July 13, 1984) from which Amp R colonies were selected 
and an^S.SS kilobase plasmid (pLSGl) was isolated. Isopropyl-beta-D= 
thiogalactoside ( IPTG)- induced DG98 cells harboring pLSGl synthesized 

20 Taq ONA polymerase indistinguishable in size from the native enzyme 
isolated from T. aquaticus . 

01 igonucleoti de-directed mutagenesis (see Zoller and Smith, 
Nuc. Acids Res. (1982) 10^:6487-6500) was used to simultaneously 1) 
introduce an Sph I site within codons 3 to 5 of the Taq DNA polymerase 
25 gene sequence (see Figure 1, nt 8-13), 2) increase the A/T content of 
four of the first seven codons without effecting a change in the 
encoded amino acids (within codons 2-7 in Figure 1), 3) delete 170 
nucleotides of the lacZ DNA and Xt aquaticus ONA 5' 'to the DNA 
polymerase gene initiation codon. 

30 Bacteriophage R408 (Russel, M., et al., Gene , (1986) 45_:333- 

338) was used to infect pLSGl/0698 cells and direct the synthesis of 
the single-stranded DNA (ss) form (plus strand) of pLSGl. Purified 
pLSGl ssONA was annealed with purified PvuII-digested BSM13 + Bgl I I 
vector fragments and the 47-mer mutagenic oligonucleotide DG26 (5'- 
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C CCTTGGGC 7CA AAAAG7GG AAGC ATGC CT CT C ATAGC TGTTT CCTS ) . Following extension 
with coli ONA polymerase ! Klenow fragment, transformation cf DG38 
cells, and selection of Arap R trans formants , the colonies were screened 
with 5* " P-labeled 0G26. Hybridizing candidates were screened for 
5 loss of the BoHI restriction site, deletion of approximately 170 base 
pairs of lac_2:T^ aquaticus DNA, and introduction of a unique 5oh_I 
site. One candidate, designated pLSG2, was sequenced and shown to 
encode the desired sequence. 

pLSGl sequence: 

10 14^ 47bp Boll! lOSbp 

CAGGTOCAGCT ATG ACC ATG MA7CT ..... 

...AAC ATG AGG GGG ATG CTG CCC CTC TTT 
PLSG2 sequence: 

S-D. SphI 
!5 CAGGAAACAGCTATG AGA GGC ATG CTT CCA CTT TTT 

Oligonucleotide-directed mutagenesis was used to introduce a 
unique BgUI site in plasmid pLSG2 immediately following the TGA stop 
codon for the Taq polymerase gene (following nucleotide 2499 in Figure 
1). As above, bacteriophage R408 was used to generate the single- 

20 stranded (plus) form of plasmid pLSG2. Purified pLSG2 ssONA was 
annealed with purified _Pvu_I I -digested BSM13 + BgUI vector fragment and 
the 29-mer mutagenic oligonucleotide SC107 (5'- 
GCATGGGGTGGTAGATCTCACTCCTTGGC). Following extension with Klenow 
fragment (50 mM each dNTP), transformation of DG98 cells and selection 

25 for Amp R transfonnants , colonies were screened with 5' 32 P-labeled 
SC107. Hybridizing candidates were screened for acquisition of a 
unique Bgl_II site. One candidate, designated pSYC1578, was sequenced 
and shown to contain the desired sequence. 
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pLSG2 sequence: 

... GCC AAG GAG TGA TAC CAC CCC A7G C 
p 5YC1573 sequence: 

33HI 



... GCC AAG GAG TGA GATC TAC CAC CCC ATG 



EXAMPLE VI 

Construction of expression vectors DDG160 and 0DGI6I 

The Anp R or Tet R lambdaPL promoter, gene N ribosome binding 
site, polyl inker, BT cry PRE ( BT) (positive retroregul atory element, 

1C described in U.S. Patent No. 4,666,848, issued Hay 19, 1987), in a 
Col El cop ts vector were constructed from previously described plasmids 
and the duplex synthetic oligonucleotide linkers DG31 and DG32. The 
DG31/32 duplex linker encodes a 5* Hind i II cohesive end followed by 
SacI , Ncol, Kpnl / Asp7 18, Xmal/Smal recognition sites and a 3' BamH I 

15 cohesive end. 

Asp718 
Sac I Nco I Xmal 
DG31 5' AGCTTATGAGCTCCATGGTACCCCGGG 

ATACTCGAGGTACCATGGGGCCCCTAG- 5 ' DG32 



20 A. Construction of ftnp* plasmid pDG160 

Plasmid pFC54.t, a 5.96 kb plasmid described in U.S. Patent 
4,666,848, supra, was digested with Hind lll and BamH I and the isolated 
vector fragnent was li gated with a 5-fold molar excess of 
nonphosphorylated and annealed DG31/32 duplex. Following ligation, 

25 the DNA was digested with Xba l (to inactivate the parent vector IL-2 
DNA fragnent) and used to transform E^ coli K12 strain DG116 to 
ampicillin resistance. Colonies were screened for loss of the des- 
ala-ser^ 5 IL-2 mutein sequence and acquisition of the 0G31/32 
polyl inker sequence by restriction enzyme digestion. The polyl inker 

30 region in one candidate, designated pDG160, was sequenced and shown to 
encode the desired polyl inker DNA sequence. 



10 



15 



61 

B. Construction of Tet* plasmid a0G161 

Plasmid pAW740CH3 (ATCC 67605), the sourca of a modified 
tetracycline resistance gene wherein the 3amH I and Hindlll restriction 
sites were eliminated, and which contains the lambda?; promoter, ger.e 
N ribosorae binding site, crv PRE in a Col El cop ts vector, was digested 
to completion with Hindlll and BamH I and the 4.19 kb vector fragment 
purified by agarose gel electrophoresis. The purified vector DNA 
fragment was ligated with a 5-fold molar excess of nonphosphoryl ated 
annealed 0G31/32 duplex. E^cpJl K12 strain DG116 was transformed 
with a portion of the DNA, and Tet R colonies screened for presence of 
4.2 kb plasmids. Several candidates were further screened by 
restriction enzyme digestion and the polyl inker region sequenced by 
the Sanger method. One of the candidates with the desired sequence 
was designated pDG161. 

EXAMPLE VII 

A. Construction of an Amp R P L promoter, gene N ribosome binding site, 
(N RBS ) Taq polymerase (832) BT cry PRE, cop ts expression vector 

To express the full-length (832 amino acid) mutated Taq 
polymerase sequence encoded by plasmid pSYC1578 under the control of 
20 the lambda P L promoter and gene N ribosome binding site, plasmids 
PSYC1578 and pFC54.t were used. Plasmid pSYC1578 was digested with 
Sp_h_I and BgJ_II and -the resulting approximate 2.5 kb Taq polymerase 
gene fragment purified by agarose gel electrophoresis and 
electroelution. Plasmid pFC54.t was digested to completion with 
25 Hindlll and BamHI and the vector fragment purified by agarose gel 
electrophoresis. The synthetic oligonucleotides DG27 (5'- 

AGCTTATGAGAGGCATG ) and DG28 (5'-CCTCTCATA) were synthesized and 
annealed. Purified pFC54.t fragment (0.085 pinoles), purified Taq 
polymerase gene fragment (0.25 pmoles) and annealed nonphosphoryl ated 
DG27/28 duplex adaptor (0.43 pinoles) were combined in 30^1 and 
ligated at -14 C. A portion of the ligated DMA was heated to 75*C (15 
minutes) to inactivate the DNA ligase in the samples and treated with 
Xbal to linearize (inactivate) any IL-2 rautein containing ligation 
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products. The ligated and digested DNA (approximately 100 ng) was 
used to transform col i K12 strain DG11S to ampicillin resistance. 
Amp colonies were screened for the presence of an approximate 3 kb 
plasmid which yielded the expected digestion products with Hind i! 

5 (621 bp + 7,410 bp), EcoftI ( 3,250 Dp + 4,781 bp) and Soh_I (3,031 bp}. 
Asp 71 3 (3,021 bp), BamHI (3,031 bp) and *u!I (4,090 bp * 3,477 b? + 
464 bp). Several candidates were subjected to DNA sequence analysis 
at the 5' larabdaP L :TaqPol junction and the 3' TaqPol :BT junction. One 
of the candidates was also screened with an anti-Taq polymerase 

10 antibody for the synthesis of an approximate 90 kd inmunoreacti ve 
antigen. Single colonies were transferred from a 30°C culture plate 
to a 41°C culture plate for two hours. The colonies were scraped with 
a toothpick from both the 30°C and 41°C plates, boiled in SOS loading 
buffer, subjected to SOS-PAGE electrophoresis and the separated 

15 proteins transferred to a nitrocellulose membrane. The membranes were 
probed with a 1 :6,000 dilution of a polyclonal anti-Taq antibody and 
developed with a . goat anti-rabbit HRP conjugate. All of the 
candidates tested showed evidence of temperature inducible approximate 
90 kd Taq polymerase- rel ated protein. One of the several plasmid 

20 candidates which directed the synthesis of Taq polymerase in E. coli 
and contained the expected DNA sequence was designated pLSGB.- 

B. Construction of a Tet R P L promoter, gene N ribosome binding site, 
Taq pol>tnerase (832) BT cry PRE cop ts expression vector 

25 To express the full length (832 amino acid) mutated Taq 

polymerase sequence encoded by plasmid pSYC1578 under control of the 
lambda P L promoter and gene N ribosome binding site in a Tet* vector, 
we used plasmids pSYC1578 and pAW740CHB. Plasmid pSYC1578 was 
digested with SphI and Bql I I and the resulting approximate 2.5 kb Taq 

30 polymerase gene fragment was purified by agarose gel electrophoresis 
and electroelution. Plasmid pAW740CHB was digested to completion with 
Hindi 1 1 and BamH I and the resulting 4.19 kb vector fragment purified 
by agarose" gel electrophoresis and electroelution. The synthetic 
oligonucleotides DG27 and DG28 (described previously) were annealed. 

35 Purified pAH740CHB vector fragment (0.12 pmoles) was ligated with 



purified Taq polymerase gene fragnent [0.24 pmoles) and annealed 
nonphosphorylated 0G27 ,'28 duplex adaptor (0.24 pmoles) in 30^1 at 14 
C A portion of the ligatad ONA (100 ng) was used to transfer* ^ 
cpJl Ki2 strain DG11S to tetracycline resistance. 7et a candidates 
were screened for the presence of an approximate 6.7 kb plasraid which 
yielded the expected digestion products with Hind II I (621 bp + 5,074 
bp), EcoRI (3,445 bp + 3.250 bp), Asp_718 (6.695 bp). Sohl (3,445 bp * 
3,250 bp), BamHI (6,695 bp) and PVulI (3,477 bp + 2,754 bp + 464 
bp). Several candidates were subjected to DNA sequence analysis at 
the 5* larabdaP L :TaqPol junction and the 3' Taq Pol :BT junction. 
Candidates were also screened by single colony imniunoblot as described 
above for the temperature inducible synthesis of Taq polymerase. One 
of the plasmid candidates which directed the synthesis of Taq 
polymerase in E^ coli. and contained the expected DNA sequence was 
designated pLSG6. 

EXAMPLE VIII 

Construction of a Het4 829 amino acid form of Tag polymerase 

The predicted fourth codon of native Taq polymerase directs 
the incorporation of a methionine residue (see pLSGl and pLSG2 5' 
sequences above). To obtain a further mutated form of the Taq 
polymerase gene that would direct the synthesis of an 829 amino acid 
primary translation, product, we used plasmids pSYC1578 and pOG161 
Plasraid pSYC1578 was. digested with SfihJ, treated with L. coU_ DNA 
polymerase I Klenow fragnent in the presence of dGTP to remove the 
four-base 3' cohesive end and generate a CTT (leucine, 5th codon) 
blunt end. Following inactivation of the ONA polymerase and 
concentration of the sample, the ONA was digested with BaJIl and the 
approximate 2.5 kb Taq polymerase gene fragnent purified by agarose 
gel electrophoresis and electroelution. Plasmid pOG161 was digested 
to completion with SacJ, repaired with E^ coH DNA polymerase I Klenow 
fragnent io. the presence of dGTP to remove the four base 3' cohesive 
end and generate an ATG terminated duplex blunt end. Following 
inactivation of the polymerase, the sample was digested with BamHI. 
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Digested pOG161 (0.146 pmole) and purified Taq polymerase 
fragnent (0.295 pmole) were ligated at 30^9/ml under sticky end 
conditions overnight. The partially ligated DNA sample (SamHI/Sql II 
ends) was diluted tc 15^g/ml and ligated for five hours under blunt 

5 end conditions. The DNA ligase was inactivated (75*C» 10 minutes) and 
the sample digested with Ncol to linearize any ligation products 
containing the pDG161 polyl inker sequence. Sixty nanograms of the 
ligated and digested DNA was used to transform coli K12 strain 
0G116 to tetracycline resistance. Tet R candidates were screened for 

10 the presence of an approximate 6.7 kb plasraid which yielded the 
expected digestion products when treated with Hindi II (612 bp + 6,074 
bp), EcoR I (3,445 bp + 3,241 bp) and Sph I (6,686 bp). Colonies were 
screened as above by single colony immunoblot for the temperature 
inducible synthesis of an approximate 90 kd Taq polymerase-related 

15 polypeptide. One of the plasmids, designated pLSG7, that directed the 
synthesis of a Taq polymerase-related polypeptide was subjected to 
Sanger sequence determination at the 5 1 lambda P L promoter :Taq 
polymerase junction and the 3' Taq polymerase :BT junction. Analysis 
of the DMA sequence at the 5' junction confirmed the restriction 

20 enzyme analysis (loss of one of the SphI sites and a 612 bp Hind I II 
fragment, slightly smaller than the 621 bp Hindi II fragment in pLSG6) 
and indicated the derivation of a plasmid encoding an 829 amino acid 
form of Taq pol>merase. 

• 

EXAMPLE IX 

25 Construction of Met289 fr289) 544 amino acid form of Taq polymerase 

During purification of native Taq polynerase (Example XIII) 
we obtained an altered form of Taq polymerase that catalyzed the 
template dependent incorporation of dNTP at 70^C. This altered form 
of Taq polymerase was immunologically related to the approximate 90 kd 
30. form described in Example XIII but was of lower molecular weight. 
Based on mobility, relative to BSA and ovalbumin following SDS-PAGE 
electrophoresis, the apparent molecular weight of this form is 
approximately 61 kd. This altered form of the enzyme is not present 
in carefully prepared crude extracts of Thermus aquaticus cells as 
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determined by SDS-PAGE Western blot analysis or in situ DNA polymerase 
activity determination (Spanos, A., and Hubscher, u. (1983) H.eth. En 2. 
91*53-277) following SDS-PAGE gel electrophoresis. This form apoears 
to be proteolytic artifact that may arise during sample handling. 

2 This lower molecular weight form was purified to homogeneity and 
subjected to N-terainal sequence determination on an A3I automated gas 
phase sequencer. Comparison of the obtained N-tenninal sequence with 
the predicted amino acid sequence of the Taq polymerase gene (see 
Figure 1) indicates this shorter form arose as a result of proteolytic 

10 cleavage between glu 28 g and ser 2go . 

To obtain a further truncated form of a Taq polymerase gene 
that would direct the synthesis of a 544 amino acid primary 
translation product we used plasmids pFC54.t, pSYC1578 and the 
complementary synthetic oligonucleotides DG29 (5'- 

15 AGCTTATGTCTCCAAAAGCT) and DG30 ( 5 ' -AGCTTTTGGAGACATA) . Plasmid pFC54.t 
was digested to completion with Hindlll and BamHI. Plasmid pSYC1578 
was digested with BstXI and treated with E^ col i DNA polymerase I 
Klenow fragment in the presence of all 4 dNTPs to remove the 4 
nucleotide 3' cohesive end and generate a CTG- terminated duplex blunt 
end encoding leu 294 in the Taq polymerase sequence (see pLSGl, 
nucleotide 880). The DNA sample was digested to completion with Bgl II 
and the approximate 1.5 kb BstXI (repaired J/BgH I Taq DNA fragment was 
purified by agarose gel electrophoresis and electrocution. The 
pFC54.t plasmid digest (0.1 pmole) was ligated with the Taq polymerase 
gene fragment (0.3 pmole) and annealed nonphosphorylated DG29/DG30 
duplex adaptor (0.5 pmole) under sticky ligase conditions at 30/tg/ml, 
15 C overnight. The DNA was diluted to approximately 10 microgram per 
ml and ligation continued under blunt end conditions. The ligated DNA 
sample was digested with Xba l to linearize (inactivate) any IL-2 
mutein-encoding ligation products. 80 nanograms of the ligated and 
digested DNA was used to transform coli K12 strain DG116 to 
ampicillin resistance. Arap R candidates were screened for the presence 
of an approximate 7.17 kb plasmid which yielded the expected digestion 
products with EcoRI (4,781 bp + 2,386 bp), PstI (4,138 bp ♦ 3,029 bp), 
Aoal (7,167 bp) and HindlH/Pstl (3,400 bp + 3 , 029 bp + 738 bp). E^ 
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col i colonies harboring candidate plasmids were screened as above by 
single colony inuiunoblot for the tsmperature-inducible synthesis of an 
approximate 61 Jed Taq polymerase related oolypeotide. In addition, 
candidate plasmids were subjected to DNA sequence detennination at the 
5' lambda ? L promoter:Taq DNA junction and the 3' Taq 0NA:3T crv_ PRE 
junction. One of the plasmids encoding the intended DNA sequence and 
directing the synthesis of a temperature- induci bl e 61 kd Taq 
polymerase related polypeptide was designated pLSG8. 

Yet another truncated Taq po1>merase gene contained within 
the ^2.68 kb HindIII-Aso718 fragment of plasmid pFC85 can be expressed 
using, for example, plasmid pP L N RBS ATG, by operably linking the amino- 
terminal Hindi II restriction site encoding the Taq £ol_ gene to an ATG 
initiation codon. The product of this fusion upon expression will 
yield an <W0, 000-72, 000 dalton truncated polymerase. 

This specific construction can be made by digesting plasmid 

pFC85 with Hind I II and treating with Klenow fragment in the presence 

of dATP and dGTP. • The resulting fragment is treated further with SI 

nuclease to remove any single-stranded extensions and the resulting 

* 

ONA digested with Asp718 and treated with Klenow fragment in the 
presence of all four dNTPs. The recovered fragment can be li gated 
using T4 ONA ligase to dephosphorylated plasmid pPi_Nrb$ atg « which had 
been digested with SacI and treated with Klenow fragment in the 
presence of dGTP to construct an ATG blunt end. This ligation mixture 

« 

can then be used to transform E. coli DG116 and the transformants 
screened for production of Taq polymerase. Expression can be 
confirmed by Western inmunoblot analysis and activity analysis. 

- EXAMPLE X 

Construction of Amp R trp promoter operator, trpL ribosome binding 
site, Taq polymerase (832) ST cry PRE cop ts expression vector 

To substitute the E^ coli trp operon promoter/operator and 
leader peptide ribosome binding site, we used plasmids pLSG5 and 
pFC52. pFC52 was the source of the trp promoter, cop t s and ampicillin 
resistant determinants. However, plasmid pCS4, described in U.S. 



Patent No. 4,711,345, supra , the disclosure of which is incoroorated 
herein by reference, may be used to provide the identical fragment. 
Plasmid pLSG5 was digested to completion with Sph l. The Soh_I was 
inactivated (70°C, 10 minutes) and the digested DNA was ligated 
overnight at 15*C with an excess of annealed nonphosphorylated DG27/23 
duplex adaptor (see above). The T4 DMA ligase was inactivated (/O^C, 
10 minutes) and the DNA digested to completion with Mlul. The ONA 
sample was sequentially extracted with phenol and ether, ethanol 
precipitated and finally resuspended in 10 mM Tris chloride pH S, 1 mM 
EOTA. Plasmid pFC52 (or pCS4) was digested to completion with Hlul 
and extracted with phenol, ether and concentrated as above. The DNA 
sample was digested to completion with H^ndll! and the Hindlll 
inactivated (75° C, 15 minutes). The pLSG5 and pFC52 samples were 
ligated overnight in equal molar ratio and at 30 /ig/ml under sticky 
end conditions. The T4 ligase was inactivated (70°C, 10 minutes) and 
the ligated DNA was digested with Xba.1 to linearize (inactivate) any 
IL-2 encoding ligation products (from the pFC52 unwanted, 1.65 kb 
Hindlll/Mlu; DNA fragment). col_i_ K12 strain DG116 was transformed 
to ampicillin resistance with 30 nanogram of the ligated DNA. Amp R 
colonies were screened for the presence of approximate 7.78 kb 
plasraids which yielded the expected digestion products with EcoR I 
(4,781 bp + 3,002 bp). S£hl (7,783 bp), Hindlll (7,162 bp + 621 bp), 
Clal (7,783 bp) and CUI/MluI (3,905 bp + 3,878 bp). Candidate 
colonies were further screened for expression of an approximate 90 kd 
Taq polymerase related protein by single colony SDS-PAGE 
immuno blotting (as above). Plasraids from two of the candidates 
showing the intended properties were transformed into coli K12 
strain KB2 (ATCC No. 53075). 

By Western imraunoblot, both plasraids in both hosts were 
shown to direct the synthesis of an approximate 90 kd Taq polymerase- 
related polypeptide upon trp_ limitation. By Comassie staining of SDS- 
PAGE fractionated whole cell extract proteins, the trp_ promoter/Taq 
polymerase 'pi asmids. in E^ cp_H K12 strain KB2 direct the accumulation 
of significantly more Taq polymerase than in E^ coli K12 strain 
0G116. One of the plasraids was designated pLSGlO. 
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EXAHPLE XI 

Synthesis of Recombinant Taq DNA Polymerase Activity in E. coli 

E. coli K12 (0G11S) strains harboring plasmids pOGISC, or 
pL5G5, or pLSG6 were grown at 32*C in Sonner-Vogel minimal salts media 

5 containing 0.5S glucose, 10 Aig/ml thiamine, 0.255 (w/v) Oifco casamino 
acids and arapicillin (100 jmg/ml ) or tetracycline (10 ^g/ml ) as 
appropriate. Cells were grown to Ag 00 of about 0.8 and shifted to 37 
°C to simultaneously dereprsss the lambda P L promoter (inacti vation of 
cl 857 repressor) and increase the copy number of the Col El cop ts 

10 .plasmid vector. After six-nine hours of growth at 37°C, aliquots of 
the cells were harvested, the cells centrifuged and the pellets stored 
at -70°C. 

Alternatively, E^ col i K12 strain KB2 harboring plasmid 
pLSGlO was grown for eight hours at 32°C in Bonner-Vogel minimal salts 
15 media containing 0.5% glucose, 5/ig/ml tryptophan, lOxj/ml thiamine, 
0.251 Difco casamino acids and 100 /xg/ml ampicillin to an A^qq of 
3.0. Cells were harvested as above. 

Cell pellets were* resuspended to about 62.5 Aggg/ml (^50- 
160 yug total protein/ml) in 50 mM Tris-Cl, pH 7.5, 1 mM EDTA, 2.4 mM 

20 PMSF and 0.5 >Ug/ml leupeptin and lysed by sonication. Aliquots of the 
sonicated extracts were subjected to SDS-PAGE and analyzed by 
Coomassie staining and Western immunoblotting with rabbit polyclonal 
anti-Taq polymerase antibody. In addition, portions of the extracts 
were assayed in a high temperature (74°C) DNA polymerase assay (see 

25 Example XIII below). 

Western immunoblotting showed significant induction and 
synthesis of an approximately 94 kd Taq DNA polymerase related 
polypeptide in induced strains harboring plasmids pLSGS, 6, and 10. 
Coomassie blue staining of SDS-PAGE-separated total cell protein 
30 revaled the presence of a new predominant protein at ^94 kd in these 
induced strains. Finally, high temperature activity assays confirmed 
the significant level of recombinant Taq DNA polymerase synthesis in 
these E. coli strains (see table, below). 
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Uninduced(- } or 

Plasmid Host Tao Pol Gene Promoter Induced (+1 Units»/ 0Q g 
PDG160/D6116 - ?l _ or+ 

PLSG5/DG1I5 * o, . 23 



PLS55/DG1I5 -!■ ?. 



308 



PLSG6/0G116 + p L s 

PLSG6/0G116 + p L + 170 

PLSG10/KB2 + Trp + 300 

* 1 unit = 10 nmole total nucleotide incorporated at 74 c C/30 minutes. 
10 EXAMPLE XII 

Purification of Recombinant Tag DNA Polymerase 

E. coli strain 0G116 harboring plasmid pLSG5 was grown in a 
10 L fermentor. The medium was 10 mM (NH 4 ) 2 S0 4 , 25 mM KH 2 P0 4 , 4 mM 
- Na 3 Citrate, 400 ^ FeCl 3 , 28^ ZnCl 2 , 34 yi CoC1 2 , 33 >M NaHo0 4 , 27 
15 CaCl 2 , 30/iM CuCl 2 , and 32 A M H3BO3. The medium" was adjusted to pH 

6.5 with NaOH, 15 mM, "and sterilized. -The following—sterile 
components were added: 20 mg/1 thiamine-HCI , 3 mM MgS0 4 , 10 g/1 
glucose and 12.5 mg/1 ampicillin. The pH was adjusted to 6.8 and held 
there using NH 4 0H. Glucose was fed to the culture in conjunction with 
the alkali demand, to maintain a glucose concentration at 402 of air 
saturation, by automatic increases in rpm (350 to 1000) and airflow (2 
to 5 1/min). Foaming was controlled on demand using polypropylene 
glycol. 

The fermentor was inoculated with cells and grown to A 680 = 
5.0 (14.25 hours). The temperature was raised to 37*C to induce 
synthesis of recombinant Taq polymerase and growth continued for five 
hours to Ag 80 of 16.5. 

Unless otherwise indicated, all purification steps were 
conducted at 4°C. Twenty grams (wet weight) of induced frozen E. coli 
K12 strain 0G116 harboring plasmid pLSG5 was thawed in 3 volumes of 50 
mH Tris-Cl, pH 7.5, 1 oM E0TA. 3 mM PMSF. 0.64 /<g/ml leupeptin and 
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disrupted in a French Press at 20,000 psi. The lysate was adjusted Co 
5.5X cell volume with additional buffer and sonicsted (4 x 30 seconds) 
to reduce viscosity (Fraction I), The crude total call ;ysata was 
adjusted to 0.2 M (NH^SO^ (25.43 g/1 ) and centrifuges for 15 minutes 

3 at 20,000X3. The supernatant (Fraction II) was heated to 75°C (in a 
100 C water bath) and maintained at 72-75 C for 15 minutes to denature 
E. col i host proteins. The sample was rapidly cooled to 4°C by 
swirling in an ice water bath. After 20 minutes at 0°C, the sample 
was centrifuged at 20 f 000XG for 15 minutes to precipitate the 

10 denatured proteins. The supernatant (Fraction III) was applied at 4 
ml/hr to a 6 ml Phenyl -Sepharose CI-4B (friarmacia) column equilibrated 
with 50 mM Tris-Cl, pH 7.5, 1 mM EDTA (Buffer A) containing 0.2 M 
(NH 4 ) 2 S0 4 . The column was sequentially washed with 3-10 column 
volumes of a) the same buffer, b) Buffer A, c) Buffer A containing 20% 

15 ethylene glycol to remove nucleic acids and non-Taq pol^nerase 
proteins. Taq DNA polymerase activity was eluted with 60 ml linear 
gradient of 0-4 M urea in Buffer A containing 202 ethylene glycol. 
The active fractions (^2 M urea') were pooled (Fraction IV) and applied 
at 3 ml/hr to a 12 ml *(1.5 X 6.0 cm) Heparin- Sepharose CL-6B 

20 (Pharmacia) column equilibrated in 50 mM Tris-Cl, pH 7.5, 0.1 mM EDTA, 
0.2S Tween 20 (Buffer B) containing 0.1 M KC1. The colunn was washed 
with 2 column volumes of Buffer B containing 0.15 M KC1. The Taq 
polymerase was eluted with a 120 ml linear gradient of 0.15-0.65 M KC1 
in Buffer B. The Taq polymerase eluted as a single A230 and activity 

25 peak at **0.29 M KC1. ' 

Purified recombinant and native Taq polymerase proteins 
coraigrate following electrophoresis on SOS-PAGE and staining with 
Coomassie blue. The purified Taq polymerase proteins migrate slightly 
faster than purified Phosphorylase B (Pharmacia), consistent with a 
30 molecular weight predicted from the DNA sequence (of pLSGS) of 93,920 
dal tons. 

the peak activity fractions were pooled and a portion 
subjected to N-terrainal amino acid sequence determination on an 
Applied Biosystems gas phase sequencer. In contrast to native Taq 
35 polymerase which has a blocked amino terminus, the sequence of the 
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Purified recombinant T aq polymerase and the individual cycle vi-l ds 
were consistent wit* the scenes predicted for the amino teml^s - 
the Taq polymerase protein encoded by plasmid pLSGS. 

The recombinant Taq polymerase encoded by olasmid pi« S * >r« 

2Zo\\l s * sscribed coald anp, * fy * human «p y - s; qu ; n ; s : 

jsmg a low temoerature 1in«r ^ , * 

. =<-jrg nm.t oi 55 extension temoeraturo of ^ s r 

upper temperature l ia1t of & I and a 2-2.5 minute cycl* time' 
comparable yields and efficiency were noted for native and recc^in^ 
Taq polymerase using 1-2 units/100 ^,1 PCR. 

EXAMPLE XIII 
Purification 

The thermostable polymerase may be purified directly from a 
culture of Jhennyj. aju^ticus. following the example disclosed below or 
alternatively,, from a bacterial culture containing the recombinant^ 
1, Produced enz^e with only minor modifications necessary in the 
preparation of the crude extract. 

After harvesting by centri fugation , 60 grams of cells were 
resuspended in 75 ml of a buffer consisting of 50 mM Tris-Cl pH 8 1 
■H EDTA. Cells were lysed in a French Press at 14,000-16,000 PS I 
after which 4 volumes (300 ml) of additional Tris-EDTA were added 
Buffer A (beta-mercaptoethanol to 5 mM and NP-40 and Tween 20 to 0 55 ' 
(v/v) each) was added and the solution was sonicated thoroughly while 
cooling. The resultant homogeneous suspension was diluted further 
with Buffer A such that the final volume was 7.5-8 times the starting 
cell weight; this was designated Fraction I. 

The polymerase activity in Fraction I and subsequent 

i7,« de ? n,1ned in 3 50 ^ mixture containing °- 025 M ™ s - 

C pH 9.4 (20 C). 0.002 M H gd 2 , 0.05 M KC1. 1 mM 2-mercaptoethanol , 
0 2 mM each dGTP. dATP. TTP. 0.1 mM dCTP [Tlpha^P. . 05 ci/rtj. 
30 12 5 ^ -activated- salmon sperm DNA and 0.01-0.2 units of the 
po ymerase (diluted in 10 m „ Tn - S . HCU pH 8§ 5Q mM J 
autoclaved gelatin, 0.5X NP-40, 0.51 Tween 20. and 1 M 2- 
mercaptoethanol). One unit corresponds to 10 nmoles of product 
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synthesized in 30 minutes. "Activated" ONA is a native preparation of 
DNA after partial hydrolysis with ONase I until 32 of the DNA was 
transferred to the acid-soluble fraction. The reaction was conducted 
at 74°C for 10 ."ninutes and then 40^1 was transferred to 1.0 ml of 50 
jjig/ml carrier DNA in 2 aM £DTA at C*C. An equal volume (1.0 ml; of 
202 TCA, 22 sodium pyrophosphate was added. After 15-20 minutes at 0 
the samples were filtered through Whatman GF/C discs and extensively 
washed with cold 52 TCA-12 pyrophosphate, followed by cold 952 
ethanol , dried and counted. 

Fraction I was centrifuged for two hours at 35,000 rpm in a 
Beckman TI 45 rotor at 2°C and the collected supernatant was 
designated Fraction II. 

The Taq polymerase activity was precipitated with Polymin P 
(BRl, Gaithersburg, M0) (102, w/v, adjusted to pH 7.5 and autoclaved) 
after the minimum amount of Polymin P necessary to precipitate 90-952 
of the activity was determined, which amount was generally found to be 
between 0.252 and 0.32 final volume. 

An appropriate level of Polymin P was added slowly to 
Fraction II while stirring for 15 minutes at 0°C. This solution was 
centrifuged at 13,000 rpm for 20 minutes in a Beckman JA 14 rotor at 2 
C. The supernatant was assayed for activity and the pellet was 
resuspended in 1/5 volume of 0.5X Buffer A (diluted 1:2 with H 2 0). 
This suspension was recentrifuged and the pellet resuspended in 1/4 
volume of Buffer A containing 0.4 M KC1. This suspension was 
homogenized thoroughly and left overnight at 4°C. The homogenate was 
centrifuged as above and the collected supernatant designated Fraction 
III. 

The protein fraction was collected by "precipitation" at 752 
saturation of ammonium sulfate, centrifuged (at 27,000 rpm, SW27 
rotor, 30 minutes) and the floating pellicle was resuspended in 50 mM 
Tris-Cl pH 8, 1 mM EOTA. These steps were repeated and the protein 
suspension was dialyzed extensively with P-cell 'buffer (20 mM KP0 4 pH 
7.5, 0.5 mM EDTA, 5 mM beta-raercaptoethanol , 52 (w/v) glycerol, 0.52 
(v/v) NP-40 and Tween 20) containing 80 mM KC1. 
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The dialysate was transferred to a centrifuge bottle to 
which war. added any recovered protein from sacks ringed wit.", the ?- 
cell buffer containing 30 oiH KCl. Centri fugation was performed at 
20,000 x g and the tiae was reduced to 13 minutes. The supernatant 
was saved and any pellet remaining was washed, extracted with ?.--]' 
buffer and 80 mM KCl, and recentrifuged. The supernatants were then 
combined to form Fraction IV. 

Fraction IV was applied to a • 2.2 x 22-cm column of 
phosphocellulose, equilibrated with the P-cell buffer containing SO mM 
KCl. The column was washed (2.5-3 column volumes) with the same 
buffer and the protein eluted using a linear gradient of SO to 400 sft 
KCl in P-cell buffer. Fractions containing DNA polymerase activity 
("0.18-0.20 M KCl) were pooled and concentrated 3-4 fold on an Amicon 
stirred cell and YM30 membrane. The cell was rinsed with the P-cell 
buffer without KCl and added to the fraction concentrate (0.15 M KCl 
15 adjusted final volume) to form Fraction V. 

Fraction V was applied to a 5 ml Heparin Se pharos e CL-6B 
column (Pharmacia) equilibrated with P-cell buffer and 0.15 M KCl. 
The column was washed with -0.15 M KCl buffer (3-4 column volumes) and 
the protein eluted with a linear gradient from 0.15 to 0.65 M KCl in 
P-cell buffer. A 1:10 dilution into diluent without gelatin was made 
for SOS-PAGE analysis and a subsequent 1:20 dilution into diluent with 
1 rag/ml gelatin was made for use in enzyme assays. The activity 
fractions (eluting *t ~0.3 M KCl) were assayed on supercoiled DNA 
template for specific and non-specific endonucl eases/topoisomerase by 
electrophoretically detecting the change in molecular weight of 
supercoiled plasmid ONA after incubation with an excess of DMA 
polymerase. Exonuclease contamination was detected following 
incubation with small linear DNA fragments. In peak fractions, 
an n 88 _ 92 led protein was found to be the major band. The major pool] 
designated Fraction VI. had the highest polymerase activity with 
minimal detectable endonucl ease activity when this pool was assayed 
for 30 minutes at 55*0 with ^3-5 polymerase units/600 ng ONA. 
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Fraction VI was dialyzed against 10 mM XPO* pH 7.5, 5 mM 
beta-mercaptoethano": , zZ V. /cerol, 0.25 NP-40, and 0.25 Tween 20 (HA 
buffer). The dialyzed sample was applied to a 3 ml column of 
hydroxyapati te and the enzyme el f jted with a linear gradient of 10 zo 
5 250 mM KP0* pH 7.5, KA buffer. DNA polynerase activity began to aluts 
at 75 mM KPC 4 with the peak at 100 mM KPO^. Active peak fractions 
were assayed at i:i00-l:300 dilution. As in the prior chromatography 
step, a 1:10 dilution in diluent was prepared without gelatin for SDS- 
PAGE analysis. Fractions with no significant endonuclease or double- 
10 strand exonuclease when assayed at 55 C with 5 polymerase units were 
pooled and designated Fraction VII. 

Fraction VII was dialyzed against a solution of 25 mM sodium 

acetate pH 5.2, 52 glycerol, 5 mM beta-mercaptoethanol , OJ mM EDTA, 

0.15 NP-40, and 0.12 Tween 20, adjusted to pH 5 at room temperature. 

15 The dialyzed sample was applied to a 2 ml DEAE-Tri s-Acryl-M (LKB) 

column pre-equi 1 ibrated and subsequently washed with the same 

buffer. The fraction containing polymerase activity that did not 

adhere to the column was pooled and adjusted to 50 mM NaCl in the same 

buffer to yield Fraction VIII. 

• 

20 Fraction VIII was applied to a 2 ml CM-Tri s-Acryl M (LKB) 

column equilibrated with the same buffer (25 mM sodium acetate, 50 ntt 
NaCl, 52 glycerol, 0.1 mM EDTA, 0.12 NP-40, and 0.12 Tween 20). The 
column was washed with 4-5 column volumes of the same buffer and the 
enzyme eluted with a linear gradient from 50 to 400 mM NaCl in sodium 

25 acetate buffer. The polymerase activity peak eluted ^0.15-0.20 M 
NaCl. The pol unerase activity was assayed at 1:300 to 1:500 dilution 
with the first dilution 1:10 into diluent without gelatin for the SDS- 
PAGE analysis. An assay across the activity peak on supercoiled ONA 
templates for specific and non-specific endonuclease/topoisomerase 

30 using DNA polymerase assay salts (25 mM TAPS-HC1 pH 9.4, 2.0 mM MgCl 2 
and 50 mM KC1 ) at 74 C was performed, as well as assays for nucleases 
on H13 ss DNA and pBR322 fragments. Active fractions with no 
detectable nuclease(s) were pooled and run on a silver stained SOS- 
PAGE mini gel. The results show a single ^88-92 led band with a 

35 specific activity of <*200,000 units/mg. 
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This specific activity is more than an order or* aagnitude 
higher than that claimed for the previously isolated Taq aolvmerase 
and is at least an order of magnitude higher than that for £, col i 
polymerase I. 

EXAMPLE XIV 

The Taq polymerase purified as described aDove in Example 
XIII was found to be free of any contaminating Taq endonuclease and 
exonuclease activities. In addition, the Taq polymerase is preferably 
stored in storage buffer containing from about 0.1 to about 0.5? 
volume/ volume of each non- ionic polymeric detergent employed. More 
preferably the storage buffer consists of 502 (v/v) glycerol, ICC 
KC1, 20 mM Tris-Cl pH 8.0, 0.1 mM ethyl enedi aminetetraacetic acid 
(EDTA), 1 mM dithiothreitol , 0.52 v/v NP-40, 0.52 v/v Tween 20, and 
200A9/ml gelatin, and is preferably stored at -20°C. 

The stored Taq polymerase was diluted in a buffer consisting 
of 25 mM Tris CI pH 8.0, 20 mM KC1 , 1 mM beta-mercaptoethanol , 0.52 
NP-40, 0.52 Tween-20, and 500 >g/ml gelatin. A reaction buffer was 
then prepared containing 50 mM KC1 , 10 mM Tris-Cl, pH 8.3, 1.5 mM 
MgCl 2 , 0.012 (w/v) gelatin,. 200^ each dNTP, l^M each of the' primers 
that define a 500 base pair target sequence on a control template from 
bacteriophage lambda, and 2.0-2.5 units Taq polymerase/assay in a 
final volume of 100^1. Template was added to the reaction buffer, 
the sample placed in a 0.5 ml polypropylene tube, and the sample 
topped with 100/41 of heavy white mineral oil to prevent evaporation. 

At least a' 10 5 -fold amplification was achieved when the 
following conditions were employed, using 1 ng of control template 
(bacteriophage lambda 0NA) where the target sequence represented 
approximately IS of the starting mass of DNA. 

First the template mixture was denatured for one minute, 30 
seconds at 94"c by placing the tube in a heat bath. Then the tube was 
30 placed in a heat bath at 37°C for two minutes. Then the tube was 
placed in a heat bath at 72°C for three minutes, and then in the heat 
bath at 94*C for one minute. This cycle was repeated for a total of 
25 cycles. At the end of the 25th cycle, the heat denaturation step 
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at 94°C was omitted and replaced by extending the 7Z°C incubation steo 
by an additional three minutes. Following termination of t^e assay, 
the samples were allowed to cool to room temperature and analyzed as 
described in previous examples. 

The template may be optimally amplified with a different 
concentration of dNTPs and a different amount of Taq polymerase. 
Also, the size of the target sequence in the DNA sample will directly 
impact the minimum time required for proper extension (72 C C incubation 
step). An optimization of the temperature cycling profile should be 
performed for each individual template to be amplified, to obtain 
maximum efficiency. 

EXAMPLE XV 

Taq polymerase purified as described above in Example I was 
formulated for storage as described in the previous example, but 
without the non-ionic polymeric detergents. When assayed for activity 
as described in that example, the enzyme storage mixture was found to 
be inactive. When the NP-40 and Tween 20 were added to the storage 
buffer, the full enzyme activity was restored, indicating that the 
presence of the non-ionic detergents is necessary to the stability of 
the enzyme formulation. 

EXAMPLE XVI 

Several 1 xg samples of human genomic DNA were subjected to 
20-35 cycles of amplification as described in Example II, with 
equivalent units of either Klenow fragnent or Taq polynerase, and 
analyzed by agarose gel electrophoresis and Southern blot. The 
primers used in these reactions, PC03 and PC04, direct the synthesis 
of a 110-bp segnent of the human beta-globin gene. The Klenow 
polymerase amplifications exhibited the smear of ONA typically 
observed with this enzyme, the apparent cause of which is the non- 
specific annealing and extension of primers to unrelated genomic 
sequences under what were essentially non-stringent hybridization 
conditions (lx Klenow salts at 37° C). Nevertheless, by Southern blot 
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a specific 110-bp beta-globin target fragment was detected in a P. 
lanes. A substantial!;/ different slectrcpnoretic saltern was seen in 
the amplifications done with 7aq polymerase where the single major 
band Is the HC-bp target sequence. This remarkable specificity was 
undoubtedly due to the temperature at which the prisers were extended/ 

Although, like Klenow fragment amplifications, tne annealing 
step was performed at 37°C, the temperature of Taq-catalyzed reactions 
had to be raised to about 70°C before the enzyme exhibited significant 
activity. During this transition from 37 to 70°C poorly matched 
primer- tempi ate hybrids (which formed at 37°C) disassociated so that 
by the time the reaction reached an enzyme- activating temperature, 
only highly complementary substrate was available for extension. This 
specificity also results in a greater yield of target sequence than 
similar amplifications done with Klenow fragment because the non- 
specific extension products effectively compete for the polymerase, 
thereby reducing the amount of UO-mer that can be made by the Klenow 
fragment. 

_ EXAMPLE XVII 

Amplification was carried out of a sample containing l^g 
Molt 4 DNA, 50 mM KC1 , 10 mM Tris pH 8.3, 10 mM MgCl 2 , 0.012 gelatin, 
lyi of each of the following primers (to amplify a 150 bp region): 

5 ' -CATGCCTCTTTGCACCATTC-3 ' ( RS7 9) and 
5 ' -TGGTAGCTGGATTGTAGCTG- 3 ' ( RS80 ) 

1.5 mM of each dNTP, and 5.0 units of Taq polymerase per 100^1 
reaction volume. Three additional samples were prepared containing 
2.5, 1.3, or 0.6 units of Taq polymerase. The amplification was 
carried out in the temperature cycling machine described above using 
the following cycle, for 30 cycles: 

from 70 to 98°C for 1 minute 
30 hold at 98*C for 1 minute 

from 98 C C to 35, 45 or 55°C for 1 minute 
hold at 35, 45 or 55°C for 1 minute 

??? 35, , 4 L or 55 ° c t0 70 °C for 1 minute 
hold at 70°C for 30 seconds 



20 



25 



78 

At 35 °Z annealing temperature, the 2.5 uni ts/100 Taq 
enzyme dilution gave the best-signal-to noise ratio by agarose gel 
electrophoresis over all other Taq pol/aerase concentrations. At 
45 P C, the 5 unit3/100 ^ Taq enzyme gave the best signal- co-noise 
5 ratio over the other concentrations. At 55°C, the 5 uni ts/lCC^i Tac 
enzyme gave the best signal-to-noise ratio over the other 
concentrations and over the 45^C annealing and improved yield. The 
Taq polymerase has more specificity and better yield at 55^C. 

In a separate experiment the Molt 4 DNA was 10-fold serially 
10 diluted into the cell line GM2064 DNA, containing no beta- or delta- 
globin sequences, availaDle from the Human -Genetic Mutant Cell 
Depository, Camden, New Jersey, at various concentrations representing 
varying copies per cell, and amplification was carried out on these 
samples as described in this example at annealing temperatures of 25°C 
13 and 55 °C. At 35°C, the best that can be seen by agarose gel 
electrophoresis is 1 copy in 50 cells. At 55°C, the best that can be 
seen is 1/5,000 cells (a 100-fold improvement over the lower 
temperature), illustrating the importance of increased annealing 
temperature for Taq polymerase specificity under these conditions. 

20 In a third experiment, DNA from a cell line 358H containing 

HIV-positive DNA, available from B. Poiesz, State University of New 
York, Syracuse, NY, was similarly diluted into the DNA from the SCI 
cell line (deposited with ATCC on March 19, 1985 ; an EBV-transformed 
beta cell line homozygous for the sickle cell allele and lacking any 

25 HIV sequences) at various concentrations representing varying copies 
per cell, and amplification was carried out as described in this 
Example at annealing temperatures of 35°C and 55 C C, using the primers 
SK38 and SK39, which amplify a 115 bp region of the HIV sequence: 

5 • -ATAATCCACCTATCCCAGTAGGAGAAAT-3 ' ( SK38) and 
30 5 • -TTTGGTCCTTGTCTTATGTCCAGAATGC-3 1 ( SK3 9) 

The results by agarose gel electrophoresis showed that only 
the undiluted 368H sample could be detected with the annealing 
temperature at 35°C, whereas at least a 10"^ dilution can be detected 
with the annealing temperature at 55°C, giving a 100-fold improvement 
35 in detection. 
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The following bacteriophage and bacterial strains we-° 
deposited with the American Type Culture Collection, 12301 ? 5 rH% 
Drive. RockvUle. Maryland, USA (ATCC). These deposits we- ,. d . 
under the provisions of the Budapest Treaty cn the International 
Recognition of the Deposit of Microorganisms for purposes of Patent 
Procedure and the Regulations thereunder (Budapest Treaty). 
Oeposi t 



Oesi qnation 
CH35:Taq#4-2 

_E. coli 0G98/ 
PFC83 

c. col i 0G98/ 
PFC85 



N-jN, 



col i DG95 ( « 7 „ 53 
cI 857 su sP 80 )/pFC54.t 
E. coli DG116/pAW74QrHR 3291 



CMCC No. 

3125 

3128 

3127 

2103 



ATCC No 

40366 

67422 

67421 

39789 

67505 



Pesos i t 
5/29/37 
5/29/87 

5/2 9/37 
8/7/84 

1/12/88 



so 

WHAT IS CLAIMED IS: 



1. A gene encoding a purified native thermostac] e SNA 
oolymerase from Thermus acuaticus having a molecular weight •:: S5- 
95,000 daltons and having at least half of the activity at oH 5.4 that 
it has at pH 8,0. 

2. The gene of claim 1 that was cloned from the genome of 
Thermus aquaticus . 

3. The gene of claim 2 that has the ONA sequence of Figure 
1 or an allelic variant thereof. 



4. The gene of claim 2 encoding a polymerase having a 
molecular weight of about 86,000-95,000 daltons. 

5. The gene of claim 4 encoding a polymerase having the 
amino acid residues of 4-832 of Figure 1. 

6. The gene of Claim 4 encoding a polymerase having the 832 
15 amino acid sequence of Figure 1. 

7. The gene of claim 2 encoding a polymerase having a 
molecular weight of about 60,000-55,000 daltons. 

8. The gene of claim 7 encoding a polymerase having the 
amino acid residues 290-832 of Figure 1. 

20 9. A thermostable enzyme that is a polymerase containing at 

least 501 homology to any contiguous stretch of nine or more amino 
acids shown in Figure 1. 

10. The thermostable polymerase of claim 9 wherein said 
contiguous stretch of nine or more amino acids is selected from the 
25 following sequences: 
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a; residues 190-204 ; 
b) residues 252-270; 



cj residues 559-587; 

4) residues 713-732; 

s; residues 743-759; and 

f) residues 778-7 90. 



claim 1. 



11. The enzyme produced recombinant^ from the 



gene of 



erminus 



12. The enzyme of claim 11 which has a non-blocked amino 



claim 4. 



13. The enzyme produced recombi nantl y from the gene of 



claim 7 



14. The enzyme produced recombinant^ from the gene of 



15. A stable enzyme composition comprising the enzyme of 
claim 11 i„ a buffer comprising one Qr mQre non . ionic 

detergents. 

16 A method for purifying a thermostable polymerase which 
compnses treating an aqueous mixture containing the thermostable 
polymerase with a hydrophobic interaction support under conditions 
which promote hydrophobic interactions and eluting said thermostable 
polymerase from said support with a solvent which attenuates 
hydrophobic interactions. 



1/. The method of claim 16 wherein the hydrophobic 
chromatographic support is Phenyl Sepharose. 
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IS. The method of claim 16 wherein said hydrophobic 
interactions are provided by a buffer with an ionic strength 
correspond! ng to greater than or equal to 0.05 M NaCl . 

19. The method of claim 13 wherein said r.y^rcshcbic 
5 interaction promotion conditions are provided using a buffer 

containing greater than or equal to 0.2 M ammonium sulfate. 

20. The method of claim 16 wherein said elution solvent 
uses a 0-4 M urea gradient. 

21. The method of claim 15. wherein the thermostable 
10 polymerase is ONA polymerase isolated from Thennus aouaticus . 

22. The method of claim 15 wherein said thermostable 
polymerase is a recombinant enzyme. 

23. The method of claim 22 wherein the aqueous mixture has 
previously been enriched in thermostable polymerase activity by heat 

15 treating the cell lysate. 

24. The method of claim 23 wherein the heat treatment is 
conducted at temperatures in the range of at least 45 C to about 90 C. 

25. A method for purifying a recombinant thermostable 
polymerase produced in a heat labile host cell which method comprises 

20 treating the cell lysate with temperatures in the range of at least 45 
C to about 90 C and recovering the thermostable polymerase activity. 



26. The method of claim 25 wherein said thermostable 
polymerase is from Thermus aquaticus. 
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TAQ DNA POLYMERIAS Z SEQDZNCZ 



-120 -100 -30 

Bglll PvuIZ 

* • * . - «, 

AAGCTC^GATCTACCTGCCTGAGGGCGTCCGGTTCC/lGCrGGCCCTTCCCGAGGGGGAGA 



-60 -40 -20 

* * • • • m 

GGGAGGCGTTTCTAAAAGCCCTTCAGGACGCTACCCGGGGGCGGGTGGTGGAAGGGTAAC 
1 20 40 60 

• • * • ♦ • « 

ATGAGGGGGATGCTGCCCCTCTTTGAGCCCAAGGGCCGGGTCCTCCTGGTGGACGGCCAC 
MetArgGlyMetLeuProLeuPheGluProLysGlyArgValLeuLeuValAspGlyHis 
1 

80 100 120 

• • • * • • 

CACCTGGCCTACCGCACCTTCCACGCCCTGAAGGGCCTCACCACCAGCCGGGGGGAGCCG 
HisLeuAlaTyrArgThrPheHisAlaLeuLysGlyLeuThrThrSerArgGlyGluPro 

140 160 180 

• • • * • • 

GTGCAGGCGGTCTACGGCTTCGCCAAGAGCCTCCTCAAGGCCCTCAAGGAGGACGGGGAC 
ValGlnAlaValTyrGlyPheAlaLysSerLeuLeuLysAlaLeuLysGluAspGlyAsp 
41 

200 220 240 

• • * • • • 

GCGGTGATCGTGGTCTTTGACGCCAAGGCCCCCTCCTTCCGCCACGAGGCCTACGGGGGG 
AlaVallleValValPheAspAlaLysAlaProSerPheArgHisGluAlaTyrGlyGly 

260 280 300 

• * • • • • 

TACAAGGCGGGCCGGGCCCCCACGCCGGAGGACTTTCCCCGGCAACTCGCCCTCATCAAG 
TyrLysAlaGlyArgAlaProThrProGluAspPheProArgGlnLeuAlaLeuIleLys 
81 

320 340 360 

Xhol 

• * • * • • 

GAGCTGGTGGACCTCCTGGGGCTGGCGCGCCTCGAGGTCCCGGGC7ACGAGGCGGACGAC 
GluLeuValAspLeuLeuGlyLeuAlaArgLeuGluValProGlyTyrGluAlaAspAsp 



FIG.I- 
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TAQ DNA POLYMERASE SEQUENCE 

380 400 420 

GTCCTGGCCAGCCTGGCCAAGAAGGCGGAAAAGGAGGGCTACGAGGTCCGCATCCTCACC 
ValLeuAlaSerLeuAlaLysLysAlaGluLysGluGIyTyrGiuVaiArcIIeLeuThr 
m 

*m « 

4 40 4 60 4 60 

GCCGACAAAGACCTTTACCAGCTCCTT^CCGACCGCATCCACGTCCTCCACCCCGAGGGG 
AlaAspLysAspLeuTyrGlnLeuLeuSerAspArglleHisValLeuHisProGluGly 

500 520 540 

Asp 728 




Ty 
161 



560 580 600 



GACTACCGGGCCCTGACCGGGGACGAGTCCGACAACCTTCCCGGGGTCAAGGGCATCGGG 
AspTyrArgAlaLeuThrGlyAspGluSerAspAsnLeuProGlyValLysGlylleGly 

620 640 660 

Hindlll . . . • 

GAGAAGACGGCGAGGAAGCTTCTGGAGGAGTGGGGGAGCCTGGAAGCCCTCCTCAAGAAC 
GluLysThrAlaArgLysLeuLeuGluGluTrpGlySerLeuGluAlaLeuLeuLysAsn 

201 

680 700 720 

CTGGACCGGCTGAAGCCCGCCATCCGGGAGAAGATCCTGGCCCACATGGACGATCTGAAG 
LeuAspArgLeuLysProAlalleArgGluLysIleLeuAlaHisMecAspAspLeuLys 

740 760 780 

CTCTCCTGGGACCTGGCCAAGGTGCGCACCGACCTGCCCCTGGAGGTGGACTTCGCCAAA 
LeuSerTrpAspLeuAlaLysValArgThrAspLeuProLeuGluValAspPheAlaLys 

241 

800- 820 840 

^ • 

AGGCGGGAGCCCGACCGGGAGAGGCTTAGGGCCTTTCTGGAGAGGCTTGAGTTTGGCAGC 

ArgArgGluProAspArgGluArgLeuArgAlaPheLeuGluArgLeuGluPheGlySer 
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TAQ DNA POLYMERASE SEQUENCE 



860 880 300 

£stXI 

CTCCTCCACGAGTTCGGCCTTCTGGAAAGCCCCAHGGCCCrG^GGAGGCCCCCTGGCCC 

LeuLeuHisGluPheGiyLeuLeuGltiSerProLvsAiaLeuGluGiuAlaProTr-cPrc 
281 290 

920 940 960 

* • * • • . 

CCGCCGGAAGGGGCCTTCGTGGGCTTTGTGCTTTCCCGCAAGGAGCCCATGTGGGCCGAT 

ProProGluGlyAlaPheValGlyPheValLeuSerArgLysGluProMetTrpAiaAsp 

980 1000 1020 

* • * • • „ 

CTTCTGGCCCTGGCCGCCGCCAGGGGGGGCCGGGTCCACCGGGCCCCCGAGCCTTATAAA 

LeuLeuAlaLeuAlaAlaAlaAxgGlyGlyArgValHisArgAlaProGluProTyrLys 
321 

1040 1060 1080 

* • • • • • 

GCCCTCAGGGACCTGAAGGAGGCGCGGGGGCTTCTCGCCAAAGACCTGAGCGTTCTGGCC 
AlaLeuArgAspLeuLysGluAlaArgGlyLeuLeuAlaLysAspLeuSerValLeuAla 



1100 1120 1140 

• • • • • * 

CTGAGGGAAGGCCTTGGCCTCCCGCCCGGCGACGACCCCATGCTCCTCGCCTACCTCCTG 

LeuArgGluGlyLeuGlyLeuProProGlyAspAspProMetLeuLeuAlaTyrLeuLeu 
361 

1160 1180 1200 

• • • • * « 

GACCCTTCCAACACCACCCCCGAGGGGGTGGCCCGGCGCTACGGCGGGGAGTGGACGGAG 
AspProSerAsnThrThrProGluGlyValAlaArgArgTyrGlyGlyGluTrpThxGlu 

1220 1240 " 1260 

• * * • • • 

GAGGCGGGGGAGCGGGCCGCCCTTTCCGAGAGGCTCTTCGCCAACCTGTGGGGGAGGCTT 

GluAlaGlyGluArgAlaAlaLeuSerGluArgLeuPheAlaAsnLeuTrpGlyArgLeu 
401 

1280 1300 1320 

* • • • • • 

GAGGGGGAGGAGAGGCTCCTTTGGCTTTACCGGGAGGTGGAGAGGCCCCTTTCCGCTGTC 
GluGlyGluGluArgLeuLeuTrpLeuTyrArgGluValGluArgProLeuSerAlaVal 



FIG.I-3 
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TAQ DNA POLYMERASE SEQUENCE 



1340 1360 13S0 

• * ■ • 

C _ Iso 1 — w .^'^irt.ljvjv.. -_i_»=^i*j\j:\j . j\^rJ.>~-3 - -;\js»'w— -k- _-iL:^vc — _ _.». 

Le'X^iaHisMetGiuAlaThrGiyValArgLeuAspValAiaTyrLeuArgAlaLeuSer 
441 * 

1400 1420 1440 

Xhol 

. • • 

CTGGAGGTGGCCGAGGAGATCGCCCGCCTCGAGGCCGAGGTCTTCCGCCTGGCCGGCCAC 

LeuGluValAlaGluGluIlsAlaArgLeuGliiAlaGluValPheArgLe'jAlaGlyHis 



1460 1430 1500 

Pvu II 

CCCTTCAACCTCAACTCCCGGGACCAGCTGGAAAGGGTCCTCTTTGACGAGCTAGGGCTT 
ProPheAsnLeuAsnSerArgAspGinLeuGluArgValLauPheAspGluLeuGlyLeu 
481 

1520 1540 1560 

* - • • * 

CCCGCCATCGGCAAGACGGAGAAGACCGGCAAGCGCTCCACCAGCGCCGCCGTCCTGGAG 
ProAlalleGlyLysThrGluLysThrGlyLysArgSerThrSerAlaAlaValLeuGlu 



1580 1600 1620 

PstI SacI 
• . . « ♦ • 

GCCCTCCGCGAGGCCCACCCCATCGTGGAGAAGATCCTGCAGTACCGGGAGCrCACCAAG 
AlaLeuArgGluAlaHisProIleValGluLysIleLeuGlnTyrArgGluLeuThrLys 
521 

1640 1660 1680 

• • • * • • 

CTGAAGAGCACCTACATTGACCCCTTGCCGGACCTCATCCACCCCAGGACGGGCCGCCTC 
LeuLysSerThrTyrlleAspProLeuProAspLeuIleHisProArgThrGlyArgLeu 



1700 1720 - 1740 

• • * • * • 
CACACCCGCTTCAACCAGACGGCCACGGCCACGGGCAGGCTAAGTAGCTCCGATCCCAAC 
HisThrArgPheAsnGlnThrAlaThrAlaThrGlyArgLeuSerSerSerAspProAsn 
561 

1760 1780 1800 

BamHI 

• • • * * * 

CTCCAGAACATCCCCGTCCGCACCCCGCTTGGGCAGAGGArCCGCCGGGCCTTCATCGCC 

LeuGlnAsnlleProValArgThrProLeuGlyGlnArglleArgArgAlaPhelleAla 
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TAQ DMA - POLYMERASE SEQUENCE 



1320 1840 i860 

Sad 

GAGGAGGGGTGGCTATTGGTGGCCCTGGACTATAGCCAGATAGAGCTCAGGGTGCTGGCC 

GluGluGlyTrpLeuLeuValAlaLeuAspTyrSerGlnlleGluLeuAraValLeuAia 
601 ~ — ~ 



1880 1900 1S20 

• . 

CACCTCTCCGGCGACGAGAACCTGATCCGGGTCTTCCAGGAGGGGCGGGACATCCACACG 
HisLeuSerGlyAspGluAsnLeuIleArgValPheGlnGluGlyArgAspIleHisThr 

1940 I960 1980 

PvuII 
♦ 

GAGACCGCCAGCrGGATGTTCGGCGTCCCCCGGGAGGCCGTGGACCCCCTGATGCGCCGG 

GluThrAlaSerTrpMetPheGlyValProArgGluAlaValAspProLeuMetArgArg 
641 



2000 2020 2040 

GCGGCCAAGACCATCAACTTCGGGGTCCTCTACGGCATGTCGGCCCACCGCCTCTCCCAG 
AlaAlaLysThrlleAsnPheGlyValLeuTyrGlyMetSerAlaHisArgLeuSerGln 

2060 2080 2100 

Nhel 

• 

GAGCrAGCCATCCCTTACGAGGAGGCCCAGGCCTTCATTGAGCGCTACTTTCAGAGCTTC 

GluLeuAlalleProTyrGluGluAlaGlnAlaPhelleGluArgTyrPheGlnSerPhe 
681 



2120 2140 2160 

* 

CCCAAGGTGCGGGCCTGGATTGAGAAGACCCTGGAGGAGGGCAGGAGGCGGGGGTACGTG 
ProLysValArgAlaTrpIleGluLysThrLeuGluGluGlyArgArgArgGlyTyrVal 

2180 2200 2220 

GAGACCCTCTTCGGCCGCCGCCGCTACGTGCCAGACCTAGAGGCCCGGGTGAAGAGCGTG 
^"TlirLeuPheGlyArgArgArgTyrValProAspLeuGluAlaArgValLysSerVal 
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TAQ DMA POLYMERASE SEQUENCE 



2240 2260 2280 

- 

CGGGAGGCGGCCGAGCGCATGGCCTTCAACATGCCCGTCCAGGGCACCGCCGCCGACCTC 
ArgGluAlaAlaGluArgMetAlaPheAsnMetProValGlnGlyThrAlaAlaAspLeu 

741 



2300 2320 2340 

ATGAAGCTGGCTATGGTGAAGCTCTTCCCCAGGCTGGAGGAAATGGGGGCCAGGATGCTC 
MetLysLeuAlaMetValLysLeuPheProArgLeuGluGluMetGlyAlaArgMetLeu 



2360 2380 2400 

Xhol 

CTTCAGGTCCACGACGAGCTGGTCCTCGAGGCCCCAAAAGAGAGGGCGGAGGCCGTGGCC 
LeuGlnValHisAspGluLeuValLeuGluAlaProLysGluArgAlaGluAlaValAla 

781 



2420 2440 2460 

• • • * * 

CGGCTGGCCAAGGAGGTCATGGAGGGGGTGTATCCCCTGGCCGTGCCCCTGGAGGTGGAG 

ArgLeuAlaLysGluValMetGluGlyValTyrProLeuAlaValProLeuGluValGlu 



2480 2500 

• • • • 

GTGGGGATAGGGGAGGACTGGCTCTCCGCCAAGGAGTGATACCACC 

ValGlylleGlyGluAspTrpLeuSerAlaLysGluEnd 

821 832 
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